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(54) Fermentative carotenold production 

(57) The present invention is directed to a DNA 
sequence comprising one or more DNA sequences 
selected from the group consisting of a DNA sequence 
which encodes the GGPP synthase of Ravobacterium 
sp. R1534 (crtE), a DNA sequence which encodes the 
prephytoene synthase of Flavobacterium sp. R1534 
(crtB), a DNA sequence which encodes the phytoene 
desaturase of Flavobacterium sp. R1534 (crtQ, a DNA 
sequence which encodes the lycopene cyclase of Fla- 
vobacterium sp. R1534 (crtY) or a DNA sequence which 
encodes the p-carotene hydroxylase of Ravobacterium 
sp. R1534 (crtZ) or DNA sequences which are substan- 
tially homdgous. vectors conrprlsing such DNA 
sequences and/or a DNA sequence which encodes the 
p-carotene p4-axygena8e of Alcaligenes strain PG-1 
(crt W) or a DNA sequence which is substantially 
homologous, cells which are transformed by such DNA 
sequences and/or vectors, a process for the preparation 
of a desired carotenoid or a mixture of carotenoids by 
cultering such transformed cells and a process for the 
preparation of a food or feed composition. 
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Description 

Over 600 different carotenoids have been described fr m carotenogenic rganisrre found among bacteria, yeast, 
fungi and plants. Currently only two of them, p-carotene arxl astaxanthin are commercially produced in microorganisms 
and used in the food arrd feed industry, p-carotene is obtained from algae and astaxanthin is produced in Pfaffia strains 
which have been generated by classical nnjtation. However, fermentation in Pfaffia has the disadvantage of long fer- 
mentation cycles and recovery from algae is aimbersoma Therefore it is desiderable to develop production systems 
which have better industrial applicability. e.g. can be manipulated for increased titers and/or reduced fermentation 
times. Two such systems using the biosynthetic genes form Erwinia herbicola and Envinia uredovora have already been 
described in WO 91/13078 and EP 393 690. respectively. Furthermore, three p-carotene ketdase genes (p-carotene p- 
4-oxygenase) of the marine bacteria Agrobacterium aurantfacum and Alcaligenes strain PC-I (crtW) [Misawa. 1995, 
Biochem. Biophys. Res. Com. 209. 867-876][Misawa, 1995, J. Bacteriology 122. 6575-6584] and from the green algae 
Haematococcus ptuviaJis (bkt) ILotan, 1995, FEBS Letters 2y54, 125-128]! Kajiwara. 1995, Plant Md. Biol. 29, 343-352] 
have been cloned £ coii carrying either the carotenogenic genes (crtE. crtB. crtY and crti) of E. herbicola [Hundle, 
1 994, MQQ 245. 406-41 6] or of 5. uredovora and complemented with the crtW gene of A aurantiacum [Misawa, 1 995] 
or the bkt gene of H. pluvial/s [Lotan. 1995][Kairwara, 1995] resulted in the accumulation of canthaxanthin (p.p-caro- 
t©ie-4,4'-dione). originating from the conversion of p-carotene, via the intemiediate echinenone (p,p-carotene-4-one). 
Introduction of the above mentioned genes (crtW or bkt) into E coli cells hartx)uring besides the carotenold biosynthe- 
sis genes mentioned above also the crtZ gene of E. uredovora [Kajiwara. 1995][Misawa, 1995]. resulted in both cases 
in the accumulation of astaxanthin (3,3*-dihydrQxy-p,p-carotene-4.4'<lione). The results obtained with the bkt gene, are 
in contrast to the observation made by others [Lotan, 1995], who using the same experimental set-up. but introducing 
the H. pluvia/is bkt gene in a zeaxarthin {p,p-carotene-3.3*-diol) synthesising E coli host hartwuring the carotenokJ bio- 
synthesis genes of E herbicola, a close relative of the above mentioned E uredoy/ora strain, did not observe astaxan- 
thin production. 

However, functionally active combinations of the carotenoid biosynthesising genes of the present invention with the 
known crtW genes have rK>t been shown so far and even more importantly there is a continuing need in even more opti- 
mized fermentation systems for industrial application. 

It is th^efore an object of the present invention to provide a DNA sequence comprising one or more DNA 
sequences selected from the group consisting of: 

a) a DNA sequence which encodes the GGPP synthase of Flavobacterium sp. R1534 (crtE) or a DNA sequence 
which is substantially homologous; 

b) a DNA sequence which encodes the prephytoene synthase of Flavobacterium sp. R1534 (crtB) or a DNA 
sequence which is substantially homologous; 

c) a DNA sequence which encodes the phytoene desaturase of Flavobacterium sp. R1534 (crtQ or a DNA 
sequence which is substantially homologous; 

d) a DNA sequence which encodes the lyccpene cyclase of Flavobacterium sp. R1534 (crtY) or a DNA sequence 
which is sUsstantially homologous; 

e) a DNA sequence which encodes the p-carotene hydroxylase of Ravobacterium sp. R1534 (crtZ) or a DNA 
sequence which is substantially homologous. 

It is also an object of the present invention to provide a vector comprising such DNA sequence, preferably in the 
form of an expression vector. Furthermore it is an object of the present invention to provide a cell which is transformed 
by such DNA sequence or vector, preferably which is a prokaryotic cell and nrare preferably which is E. coli or a Bacillus 
strain. Such transformed cell which is an eukaryotic cell, preferably a yeast cell or a fungal cell, is also an object of the 
present invention. Rnally the present invention concerns also a process for the preparation of a desired carotenoid or 
a mixture of carotenoids by culturing such a cell under suitable culture concfitions and isolating the desired carotenoki 
or a mixture of carotenoids from such cells or the culture medium and. in case only one carotenoid is desired separating 
it by methods known in the art from other carotenoids which might also be present and a process for tiie preparation of 
a food or feed composition characterized therein that after such a process has been effected the carotenoid or carote- 
noid mixture is added to food or feed. 

Furthermore, a DNA sequence comprising the following DNA sequences is an object of tiie present invention: 

a) a DNA sequence which encodes the GGPP synthase of Flavot)aclerium sp. R1534 (crtE) or a DfMA sequence 
which is substarnially homologous, and 
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b) a DNA sequence which encodes the prephytoene synthase of Ravobacterium sp. R1534 (crtB) of a DNA 
sequence which is substantially homolog us. and 

c) a DNA sequence which encodes the phytoene desaturase of Flavobacterlum sp. R 1534 (crti) or a DNA 
s sequence which is substantially homologous. 

It is also £in object of the present invention to provide a vector comprising such DNA sequence, preferably in the 
form of an expression vector Furthermore it is an object of the present invention to provide a cell which is transformed 
by such DNA sequence or vector, preferably which is a prokaryotic cell and more preferably which is E. coli or a Bacillus 
10 strain. Such transformed cell which is an eukaryotic cell, preferably a yeast cell or a fungal cell, is also an object of the 
present invention. Finally the present invention concerns also a process for the preparation of a desired carotenoid or 
a mixture of carotenoids by culturing such a cell under suitable conditions and isolating the desired carotenoid or a mix- 
ture of carotenoids from such cells or the culture medium and, in case only one carotenoid is desired separating it by 
methods known in the art from other carotenoids which might also be present, preferably such a process for the prep- 
15 aration of lycopene and a process for the preparation of a food or feed composition characterized therein that after such 
a process has been effected the carotenoid, preferably lycopene or carotenoid mixture, preferably a lycopene compris- 
ing mixture is added to food or feed. 

Furthermore a DNA sequence corrprising the following DNA sequence is also an object of the present invention: 

20 a) a DNA sequence which ertcodes the GGPP synthase of Flavobacterlum sp. R1534 (crtE) or a DNA sequence 
which is substantially homologous, and 

b) a DNA sequence which encodes the prephytoene synthase of Flavobacterlum sp. R1534 (crtB) or a DNA 
sequence which is substantially homologous, and 

25 

c) a DNA sequence which encodes the phytoene desaturase of Ravobacterium sp. R1534 (crtI) or a DNA 
sequence which is substantially homologous, and 

d) a DNA sequence which encodes the lycopene cyclase of Ravobacterium sp. R1534 (crtY) or a DNA sequence 
30 which is substantially homologous. 

it is also an object of the present invention to provide a vector comprising such DNA sequence, preferably in the 
form of an expression vector. Furthermore it is an object of the present invention to provide a cell which is transformed 
by such DNA sequence or vector, preferably which is a prokaryotic cell and more preferably which is E. coli or a Bacillus 

35 Strain. Such transformed cell which is an eukaryotic cell, preferably a yeast cell or a fungal cell, is also an object of the 
present invention. Finally the present invention concerns also a process for the preparation of a desired carotenoid or 
a mixture of carotenoids by culturing such a cell under suitable conditions and isolating the desired carotenoid or a mix- 
ture of carotenoids from such cells or the culture medium and, in case only one carotenoid is desired separating it by 
methods known in the art from other carotenoids which might also be present, preferably such a process for the prep- 

40 aration of p-carotene and a process for the preparation of a food or feed composition characterized therein that after 
such a process has been effected the carotenoid. preferably p-carotene or carotenoid mixture, preferably a p-carotene 
comprising mixture is added to food or feed. 

Furthermore a cell which is transformed by the above mentioned DNA sequence comprising subsequences a) to 
d) or the vector comprising it and a second DNA sequence which encodes the p-carotene p4-oxygenase of Alcaligenes 

45 strain PC-1 (crt W) or a DNA sequence which is substantially homologous or a secorxj vector which comprises a DNA 
sequence which encodes the p-carotene p4-oxygenase of Alcaligenes strain PC-1 (crt W) or a DNA sequence which is 
substantially homologous; and a process for the preparation of a desired carotenoid or a mixture of carotenoids by cul- 
turing such cell under suitatsle conditions and isolating the desired carotenoid or a mixture of carotenoids from such 
cells or the culture medium and, in case only one carotenoid is desired separating it by methods known In the art from 

so other carotenoids which might also be present, preferably such a process for the preparation of echinenone and a proc- 
ess for the preparation of a food or feed composition characterized therein that after such a process has been effected 
the carotenoid. preferably echinenone or carotenoid mixture, preferably an echinenone comprising mixture is added to 
food or feed. 

Furthermore it is an object of the present invention to provide a DNA sequence as mentioned above conrprising 
65 subsequences a) to d) and a Dt^ sequence which encodes the p-carotene p4-oxygenase of Alcaligenes sti-ain PC-1 
(crt W) or a DNA sequence which is substantially homologous and a vector comprising such DNA sequence, preferably 
in the form of an expression vector. Furthermore it is an object of the present invention to provide a cell which Is trans- 
formed by such DNA sequence or vector, preferaksly which is a prokaryotic cell and more preferably which is E. coli or 
a Bacillus strain. Such transformed cell which is an eukaryotic cell, preferably a yeast cell or a fungal cell, is also an 
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object of the present Invention. Rnally the present invention concerns also a process for the preparation of a desired 
carotenoid or a mixture of carotenoids by cufturing such a cell under suitable culture conditions and isolating the desired 
carotenoid or a mixture of carotenoids from such cells of the cultur medium and. in case only ne carotenoid is desired 
separating it by methods known in the art from other carotenoids which might also be present espedally suc^ a proc- 
5 ess for the preparation of echinenone or canthaxanthin and a process for the preparation of a food or feed compositing 
characterized therein that after such a process has been effected the carotenoid, preferably echinenone or canthaxan- 
thin or carotenoid mixture, preferably a echinenone or canthaxanthin containing n^xture is added to food or feed. 
Furthermore a DNA sequence comprising the following DNA sequences is also an object of the present invention: 

10 a) a DNA sequence which encodes the GGPP synthase of Ravobacterium sp. R1534 (crtE) or a DNA sequence 
which is substantially homologous, and 

b) a DNA sequence which encodes the prephytoene synthase of Ravobacterium sp. R1534 (crtB) or a DNA 
sequence which is substantially homdo^us. and 

IS 

c) a DNA sequence which encodes the phytoene desaturase of Ravobacterium sp. R1534 (crtQ or a DNA 
sequence which is substantially homologous, and 

d) a DNA sequence which encodes the lycopene cyclase of Ravobacterium sp. R1534 (crtY) or a DNA sequence 
20 which is substantially homologous, and 

e) a DNA sequence which encodes the p-carotene hydroxylase of Ravobacterium sp. R1534 (crtZ) or a DNA 
sequence which is sd3Stantially homologous. 

25 It is also an object of the present invention to provide a vector conprtsing such DNA sequence, preferably in the 
form of an expression vector. Furthermore it is an object of the present invention to provide a cell which is transformed 
by such DNA sequence or vector, preferably which is a proKaryotic cell and more preferably which is E. coli or a Bacillus 
strain. Such transformed cell which is an eukaryotic cell, preferat>ly a yeast cell or a fungal cell, is also an object of the 
present invention. Rnally the present invention concerns also a process for the preparation of a desired carotenoid or 

30 mixture of carotenoids by culturing such a cell under suitable conditions and isolating the desired carotenoid or a mix- 
ture of carotenoids from such cells of the culture medium and. in case only one carotenoid is desired separating it by 
methods known in the art from other carotenoids which might also be present, preferably such a process for the prep- 
aration of zeanxarrthin and a process for the preparation of a food or feed composition characterized therein that after 
such a process has been effected the carotenoid, preferably zeaxanthtn or the carotenoid mixture, preferably a zeaxan- 

35 thin containing mixture is added to food or feed. 

Furtiiermore a DNA sequence as mentioned above comprising subsequences a) to e) and in addition a DNA 
sequence which encodes the p-carotene p4-oxygenase of Alcaligenes strain PC-1 (crt W) of a DNA sequence which is 
substantially homologous is an object of the present inverrtion and to provide a vector comprising such DNA sequence, 
preferably in form of an expression vector. Furtiiermore it is an object of the present invention to provide a ceil which is 

40 transformed by such DNA sequence or vector, preferably which is a prokaryotic cell and more preferably which is E. coli 
or a Bacillus strain. Such transformed cell which is an eukaryotic cell. preferat>ly a yeast cell or a fungal cell, is also an 
object of the present invention. Rnally the present invention concerns also a process for the preparation of a desired 
carotenoid or a mixture of carotenokls by culturing such a cell under suitable culture conditions and isolating the desired 
carotenoid or a mixture or carotenoids from such cells of the culture medium arKl. in case only one carotenoid is desired 

45 separting it by methods known in the art from otiier carotendds which might also be present preferably such a process 
for the preparation of zeaxantiiin, adonixanthin or astaxanthin and a process for the preparation of a food or feed com- 
position characterized therein tiiat after such a process has been effected the carotenoid. preferably zeaxantiiin. adon- 
ixanthin or astaxanthin or carotendd mixture, preferably a zeaxanthin. adonixanthin or astaxanthin containing mixture 
is added to food or feed. 

so Furthermore a ceil which is transformed by tiie DNA sequence mentioned above comprising sut^equences a) to 
e) or a vector comprising such DNA sequence and a second DNA sequence which encodes the p-carotene p4-0Kyge- 
nase of Alcaligenes strain PC-1 (crt W) or a DNA sequence which is sut)stamially homologous or a second vector which 
comprises a DNA sequence which encodes the p-carotene p4-oxygenase of Alcaligenes strain PC-1 (crt W) or a DNA 
sequence which is sut}stantially homologous is also an object of the present invention and a process for the preparation 

ss of a desired carotenoid or a mixture of carotenoids by culturing such a cell under suttat}le conditions and isolating the 
desired carotenoid or a mixture of carotenoids from such cells or the culture medium, and in case only one carotenoid 
is desired separating it by methods known in the art from other carotendds which might also be present, preferably 
such a process for the preparation of zeanxanthin or adonixanthin and a process for the preparation of a food or feed 
composition characterized therein that after such a process has been effected the carotemsid. preferably zeaxanthin or 
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adontxanthin or carotenoid mixture, preferably a zeaxanthin or adonixanthin containing mixture is added to food or feed. 

In this context it should be mentioned that the expression "a DNA sequence is substantially homologous** refers 
with respect to the crtE encoding DNA sequence to a DNA sequence which encodes an amino acid sequence which 
shows more than 45 %. preferably more than 60 % and more preferably more than 75 % and most preferably more than 

5 90 % identical amino acids when compared to the amino acid sequence of crtE of Flavobacterium sp. 1534 and is the 
amino acid sequence of a polypeptide which shows the same type of enzymatic activity as the enzyme encoded by crtE 
of Ravobacterium sp. 1534. In analogy with respect to crtB this means more than 60 %. preferably more than 70 %, 
more preferably more than 80 % and most preferably more than 90 %; with respect to crtl this means more than 70 
preferably more than 80 % and most preferably more than 90 %; with respect to crtY this means 55 %. preferably 70 %. 

10 more preferably 80 % and most preferably 90 %; with respect to crtZ this means more than 60 %, preferably 70 %, more 
preferably 80 % and most preferably 90 %; with respect to crt W this also means more than 60 %. preferably 70 more 
preferably 80 % and most preferably 90 %. Sequences which are substantially homologous to crt W are known, e.g. in 
form of the p-carotene p4-axygenase of Agrobacterium aurantiacum or the green algae Haematococous pluvialis (bkt). 
DNA sequences in form of genomic DNA. cDNA or synthetic DNA can be prepared as known in the art [see e.g. 

IS Sannbrook et al.. Molecular Cloning. Cold Spring Habor Laboratory Press 1989] or, e.g. as specifically described in 
Examples 1 , 2 or 7. 

The cloning of the DNA-sequences of the present invention from such genomic DNA can tiian be effected, e.g. by 
using the well known polymerase chain reaction (PCR) method. The prindples of this method are outlined e.g. in PCR 
Protocols: A guide to Methods and Applications, Academic Press. Inc. (1990). PCR is an in vitro method for producing 

20 large amounts of a specific DNA of defined length and sequence from a mixture of different DNA-sequences. Thereby. 
PCR is based on the enzymatic amplification of the specific DNA fragment of interest which is flanked by two oligonu- 
cleotide primers which are specific for this sequence and which hybridize to the opposite strand of the target sequence. 
The primers are oriented with their 3* erxis pointing toward each other. Repeated cycles of heat denaturation of the tem- 
plate, annealing of the primers to their complementary sequences and extension of the annealed primers with a DNA 

2S polymerase result in the amplification of the segment between the PCR primers. Since tiie extension product of each 
primer can serve as a template for tiie otiier, each cycle essentially dout>Ies the amount of the DNA fragment produced 
in tiie previous cycle. By utilizing the thermostable Taq DNA polymerase, isolated from the thermophilic bacteria Ther- 
mus aquaticus. it has been possit)le to avoid denaturation of the polymerase which necessitated the addition of enzyme 
after each heat denaturation step. This development has led to tiie automation of PCR by a variety of simple tempera- 

30 ture-cycling devices. In addition, the specificity of the amplification reaction is increased by allowing the use of higher 
temperatures for primer annealing and extension. The increased specificity improves the overall yield of amplified prod- 
ucts by minimizing the competition by non-target fragments for enzyme and primers. In this way the specific sequence 
of interest is highly amplified and can be easily separated from the non-specific sequences by methods known in the 
art. e.g. by separation on an agarose gel and cloned by methods known in the art using vectors as descried e.g. by 

35 Holten and Graham in Nucleic Acid Res. 13. 11 56 (1991), Kovalic et. al. in Nucleic Acid Res. 12. 4560 (1991). Marchuk 
et al. In Nucleic Acid Res. 19. 1 154 (1991) or Mead et al. in Bio/rechnology 9. 657-663 (1991). 

The oligonucleotide primers used in the PCR procedure can be prepared as known in the art and described e.g. in 
Sambrook et al.. s.a. 

Amplified DNA-sequences can than be used to screen DNA libraries by metiiods known in the art (Sambrook et al.. 

40 s.a.) or as specifically described in Examples 1 and 2. 

Once complete DNA-sequences of the present invention have been obtained they can be used as a guideline to 
define new PCR primers for the cloning of substantially homologous DNA sequences from other sources. In addition 
they cind such homologous DNA sequences can be integrated into vectors by methods known in the art and described 
e.g. in Sambrook et al. (s.a.) to express or overexpress the encoded polypeptide(s) in appropriate host systems. How- 

45 ever, a man skilled tn the art knows that also the DNA-sequences themselves can be used to transform the suitatsle host 
systems of the invention to get overexpression of tiie encoded polypeptide. Appropriate host systems are for example 
Bacteria e.g. E. coli. Bacilli as. e.g. Bacillus subtilis or Flavobacter strains. E. coll. which could be used are E. coli K12 
strains e.g. M15 [described as DZ 291 by Villarejo et al. in J. Bacteriol. 12Q. 466-474 (1974)]. HB 101 [ATCC No. 33694] 
or E. cdi SGI 3009 [Gottesman et al.. J. Bacteriol. 148. 265-273 (1 981)]. Suitable eukaryotic host systems are for exam- 

50 pie fungi, like Aspergilli. e.g. Aspergillus niger [ATCC 9142] or yeasts, like Saccharomyces, e.g. Saccharomyces cere- 
visiae or Pichia, tike pastoris, all available from ATCC. 

Suitable vectors which can be used for expression in E. coli are mentioned, e.g. by Sambrook et al. [s.a.] or by Rers 
et al. in Procd. 8th Int. Biotechnology Symposium" [Soc. Franc, de Microbiol., Paris (Durand et al., eds.), pp. 680-697 
(1988)] or by Bujard et al. in Methods in Enzymology. eds. Wu and Grossmann, Academic Press. Inc. Vol. 155. 416-433 

55 (1987) and StOber et al. in Immunological Metiiods. eds, Lefkovits and Pernis, Academic Press. Inc.. Vol. IV. 121-152 
(1990). Vectors which could be used for expression in Bacilli are known in the art and described, e.g. in EP 405 370. 
EP 635 572 Procd. Nat Acad. Sci. USAfiL 439 (1984) by Yansura and Henner, Meth. Enzym. 185. 199-228 (1990) or 
EP 207 459. Vectors which can be used for expression in fungi are known in the art and described e.g. in EP 420 358 
and for yeast in EP 183 070, EP 183 071 . EP 248 227. EP 263 31 1 . 
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Once such DNA-sequences have been expressed in an appropriate host cell in a suitable medium the caroten t6s 
can be isolated either from the medium in the case they are secreted into the medium or from the host organism and, 
if necessary separated from other carotenoids if present in case one specific carotenotd is desired by methods known 
in the art (see e.g. Carotenoids Vol lA: Isolation and Analysis, G. Britton, S. Liaaen^ensen, H. Pfander; 1995, 
5 Birkhduser Verlag. BaseQ. 

The carotenoids of the present invention can be used in a process for the preparation of food or feeds. A man 
skilled in the art is familiar with such process. Such compound foods or feeds can further comprise additives or compo- 
nents generally used for such purpose and known in the state of the art. 

After the invention has been described in general hereinbefore, the following figures and examples are intended to 
10 illustrate details of the invention, without thereby limiting it in any matter. 

Raure 1 : The biosynthesis pathway for the formation or carotenoids of Flavobactenum sp. R1534 is illu^rated 
explaining the enzymatic activities which are encoded by DNA sequences of the present invention. 



15 fMQure 2 : Southern blot of genomic Flavobacterium sp. R1 534 DNA digested with the restriction enzymes shown 
on top of each lane and hybridized with Probe 46F. The arrow indicated the isolated 2.4 kb Xhol/PstI 
fragment 

Figure 3 : Southern blot of genomic Flavobacterium sp. R1534 DNA digested with Clal or double digested with 
20 Clal and Hindlll. Blots shown in Panel A and B were hybridized to probe A or probe B. respectively (see 

examples). Both Clal/Hindlll fragments of 1.8 kb and 9.2 kb are Indicated. 



25 



Figure 4: Southern blot of genomic Flavobacterium sp. R1534 DNA cSgested with the restriction enzymes shown 
on top of each lane and hybridized to probe C. The isolated 2.8 kb Sal 1/Hindlll fragment is shown by 
the arrow. 



30 



noure 5 : Southern blot of genomic Flavobacterium sp. R1534 DNA digested with the restriction enzymes shown 
on top of each lane and hybrkiized to probe D. The isolated Bcll/SphI fragment of approx. 3 kb is shown 
by the arrow. 

Figure 6 : Physical map of the organization of the carotenoki biosynthesis cluster in Ravobacterium sp. R1534, 
deduced from the genomic clones obtained. The location of the probes used for the screening are 
shown as bars on the respective clones. 



35 Rgure 7 : Nucleotide sequence of the Flavobacterium sp. R1534 carotenokl biosynthesis cluster and its flanking 
regions. The nucleotide sequence is numbered from the first nucleotide shown (see BamHI site of Rg. 
6). The deduced amino acid sequence of the ORF's (orf-5p ori-1, crtE, crtB, crti, crtY crtZ and orf-16) 
are shown with the single-letter amino acid code. Arrow (->) indicate the direction of the transcription; 
asterisks, stop codons. 



45 



so 



Roure 8 : Protein sequence of the C3GPP synthase (crtE) of Flavobacterium sp. R1534 with a MW of 31331 Da. 

Roure 9 : Protein sequence of the prephytoene synthetase (crtB) of Flavobacterium sp. R1534 with a MW of 
32615 Da. 

Rgure 10 : Protein sequence of the phytoene desaturase (crtl) of Flavobacterium sp. R1 534 with a MW of 5441 1 
Da. 

Rqure 1 1 : Protein sequence of the lycopene cyclase (crtY) of Flavobacterium sp. R1 534 with a MW of 42368 Da. 

Roure 12 : Protein sequence of the p-carotene hydroxylase (crtZ) of Flavobacterium sp. R1534 with a MW of 
19282 Da. 



Raure 13 : Recombinant plasmids containing deletions of the Flavobacterium sp. R1534 carotenoid biosynthesis 
55 gene cluster. 

Raure 14 : Primers used for PCR reactions. The underiined sequence is the recognition site of the indicated restric- 
tion enzyme. Small caps Indicate nudeotkies infroduced by mutag nests. Boxes show th artificial RBS 
which is recognized in B. sut>tilis. Small caps in bold show the location of the original adenine aeating 
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Figure 1 5 : 

10 

Figure 16: 

RqyrQ 17; 
IS FiqMre 19: 
Figure 19 : 
Figure 20 



nqure21 



ngure 22: 
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Figure 23: 
Figure 24: 

40 

Figure 2^: 
4S Rgure 26 : 

50 Figure 27 : 



the translation start site (ATG) f the fbllcwing gene (see original operon). All the ATG's f the original 
Ftavobacter carotenoid biosynthetic genes had to be destroyed to not interfere with the rebuild transcrip- 
tion start site. Arrows indicate start and ends of the indicated Ravobacterium R1534 WT carotenoid 
genes. 

Linkers used for the different constructions. The underlined sequence is the recognition site of the indi- 
cated restriction enzyme. Small caps indicate nucleotides introduced by synthetic primers. Boxes show 
the artificial RBS which is recognized in B. subtilis. Arrow indicate start and ends of the indicated Flavo- 
bacterium carotenoid genes. 

Costruction of plasmids pBIIKS{+)-Gtone59-2. pLyco and pZea4. 
Construction of plasmid p602CAR. 

Construction of plasmids pBIIKS(+)-CARVEQ-E and p602 CARVEQ-E. 
Construction of plasmids pHP13-2CARZYIB-EINV and pHP13-2PN252YIB-EINV. 
Construction of plasmid pXI12-2YIB-EINVMUTRBS2C. 

Norhern blot analysis of B. subtilis strain BS1012::ZYIB-EirsfV4. Panel A: Schematic representation of a 
reciprocal integration of plasmid pXI12-ZYIB-EINV4 into the ievan-sucrase gene of B-subtilis. Panel B: 
Northern blot obtained with probe A (PCR fragment which was obtained with CAR 51 and CAR 76 and 
hybridizes to the 3' end.of crtZ and the 5* end or crtY). Panel C: Northern blot obtained with probe B 
(BamHI-XhoI fragment isolated from plasmid pBIIKS(+)-crtE/2 and hybridizing to the 5' part of the crtE 
gene). ■ 

Schematic representation of the integration sites of three transformed Bacillus subtilis strains: 
BS1012::SFCO. BS1012::SFCOCAT1 and BA1012::SFCONEO1. Amplification of the synthetic Ravo- 
bacterium carotenoid operon (SFCO) can only be obtained in those strains havirig ampfifiable struc- 
tures. Probe A was used to determine the copy nunlber of the integrated SFCO. Erythromycine 
resistance gene (ermAM), chloramphenicol resistance gene (cat), neomycine resistance gene (neo). 
terminator of the cryT gene of B. subtilis (cryT). Ievan-sucrase gene (sac-B 5* and sac-B 37. plasmid 
sequences of pXI12 (pXI12). promoter originatir^ from site I of the veg promoter complex (Pvegl). 



Construction of plasmids 
EINV4MUTRBS2CCAT. 



pXn 2-ZYIB-EINV4MUTRBS2CNEO and pXII 2-2YIB- 



Complete nucleotide sequence of plasmid pZea4. 

Synthetic crtW gene of Alcaligenes PC-1 . The translated protein sequence is shown ak)ove the double 
strarxled DNA sequence. The twelve oligonucleotides (crtW1-crtW12) used for the PCR synthesis are 
underlined. 

Construction of plasmid pBIIKS-crtEBIYZW. The Hindlll-Pml! fragment of pALTER-Ex2-crtW. carrying 
the syntiietic crtW gene, was cloned into the IHindltl and Mlul (blunt) sites. Pvegl and Ptac are the pro- 
moters used for the transcription of the two opera. The C0IEI replication origin of tNs plasmid is com- 
patible with the p15A origin present in the pALTER-Ex2 constructs. 

Relevant inserts of all plasmids oonstucted in Example 7. Disrupted genes are shown by//. Restriction 
sites: S^Sacl. X^^XbaL H^Hindlll, N»Nsil. Hp»Hpal. Nd^Ndel. 



Roure 28 : Reaction products (carotenoids) obtained from p-carotene by the process of the present invention. 
55 Example 1 

Materials and general m ethods used 

Bacterial strains and plasmids: Fiavobacterium sp. R1534 WT (ATCC 21588) was the DNA source for tiie genes 
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cloned. Partial geiorrdc libraries of Ffavobacterium sp. R1 534 WT DNA were constructed into the pBluescriptlI+(KS) or 
(SK) vector (Stratagene, La Jolla, USA) and transformed into E, colt XL-1 blue (Stratagene) or JM109. 

Media and growth conditions: Transformed E. coU were grown in Luria broth (LB) at 37^ C with 1 0O^g Annpidllin 
(Amp)/ml for selection. Fiavobacterium sp. R1 534 WT was grown at 27** C in medium containing 1% glucose, 1% tryp- 
5 tone (Difco Laboratories), 1% yeast extract (Drfco). 0.5% MgS04 7H2O and 3% Nad. 

Colony screening: So-eening of the E. colt transformants was done by PCR basically according to the method 
described by Zon et al. [Zon et al.. BioTechniques Z. 696-698 (1989)] using the following primers: 

Primer #7: 5--CCTGGATGACX3TGCTGGAATATrCC-3* 

Primer #8: y-CAAGGCCCAGATCGCAGGCG-3* 

Genomic DNA: A 50 ml overnight culture of Ffavobacterium sp. R1 534 was centrifuged at 1 0,000 g for 1 0 minutes. 
The pellet was washed briefly with 10 ml of lysis buffer (50 mM EDTA, 0.1M NaCI pH7.5). resuspended in 4 ml of the 

IS same buffer sumplemented with 10 mg of lysozyme and incubated at 37**C for 15 minutes. After addition of 0.3 ml of N- 
Lauroyl sarcosine (20%) the incubation at 37^C was continued for another 15 minutes before the extraction of the DNA 
with phenol, phenol/chloroform and chloroform. The DNA was ethanol precipitated at room temperature for 20 minutes 
in the presence of 0.3 M sodium acetate (pH 5.2), followed by centrifugation at 10.000 g for 15 minutes. The pellet was 
rinsed with 70% ethanol, dried and resuspended in 1 ml of TE (10 mM Tris, ImM EDTA. pH 8.0). 

20 All genomic DNA used In the southem blot analysis and cloning experiments was dialysed against H2O for 48 
hours, using collodium bags (Sartorius. Germany), ethanol precipitated in the presence of 0.3 M sodum acetate and 
resuspended in H2O. 

Prot>e labelling: DNA probes were labeled with (a-^^P) dQTP (Amersham) by random-priming according to [Sam- 
brook et al.. 8.a.]. 

25 Probes used to screen the mini-libraries: Probe 46F is a 1 1 9 bp fragment obtained by PCR using primer #7 and 
#8 and Ffavobacterium sp, R1534 genomic DNA as template. This probe was proposed to be a fragment of the Fiavo- 
bacterium sp. R1534 pf^oene synthase (crtB) gene, since it shows significant homology to the phytoene synthase 
genes from other species (e.g. £ uredovora, E. herbicola). Probe A is a BstXI - PstI fragment of 1 84 bp originating from 
the right arm of the insert of done 85. Probe B is a 397 bp Xhol • NotI fragment obtained from the left end of the insert 

30 of clone 85. Probe C is a 538 t}p Bglll - PstI fragment from the right end of the insert of done 85. Probe D is a 376 bp 
Kpnl - BstYl fragment isolated from the insert of done 59. The localization of the individual probes is shown in figure 6. 

Oligonucleotide synthesis: The oligonudeotides used for PCR reactions or for sequencing were synthesized with 
an Applied Biosystems 392 DNA synthesizer. 

Southem blot analysis: For hybridization experiments Fiavobacterium sp. R1534 genomic DNA (3 (ig) was 

35 digested with the appropiate restriction enzymes and electrqphoresed on a 0.75% agarose gel. The transfer to Zeta- 
Probe blotting membranes (BIO-RAD). was done as descn'bed [Sourthern, E.M., J. Mol. Bid. 93. 503 (1975)]. Prehy- 
bridization and hybridization was in 7%SDS, 1% BSA (fraction V; Boehringer), O.SM Na2HP04, pH 7.2 at 65°C. After 
hybridization tiie membranes were washed twice for 5 minutes in 2x SSC, 1% SDS at room temperature and twice for 
15 minutes in 0.1% SSC. 0.1% SDS at 65** C. 

40 DNA sec|uence analysis: The sequence was determined by the dideoxy chain termination technique [Sanger et 
al., Proc. Nati. Acad. Sd. USA Z4> 5463-5467 (1977)] using tiie Sequenase Kit (United States BiochmnicaO. Both 
strands were completely sequenced and the sequence analyzed using the GCG sequence analysis software package 
(Version 8.0) by Genetics Computer. Inc. [Devereux et al.. Nucleic Acids. Res. 12. 387-395 (1984)]. 

Analysis of carotenolds: E. coli XL-1 or JM109 cells (200 - 400 mQ canning different plasmid consti-ucts were 

45 grown for the times indicated in the text, usually 24 to 60 hours, in LB suplemented with 100)ig Arrpidllin/ml. in shake 
flasks at 37** C and 220 rpm. 

The carotenoids present in the microorganisms were extracted with an adequate volume of acetone using a rota- 
tion homogenizer (Polytron. Klnematica AG. CH-Luzern). The homogenate was the ffltered through the sintered glass 
of a suction fitter into a rourxJ bottom flask. The filtrate was evaporated by means of a rotation evaporator at 50^ C using 

so a water-jet vacuum. For the zeaxanthin detection the residue was dissolved in n-hexane/acetone (86:1 4) before analy- 
sis with a normalphase HPLC as described in [Weber, S. in Analytical Methods for Vitamins and Carotenoids in Feed. 
Keller. H.E. . Editor, 83-85 (1988)]. For the detection of p-carotene and lycopene the evaporated extract was dissolved 
in n-hexane/acetone (99:1) and analysed by HPLC as desaibed in [Hengartner et al., IHelv. Chim. Acta 25, 1848-1865 
(1992)]. 

55 
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Example 2 

Cipninq of the f /^ypfracKerftim ^Pi R1 934 pprQtQn |t< bio^vnthPttp genes, 

5 To identify and isolate DNA fragments carrying the genes of the carotenoid biosynthesis pathway, we used the DNA 
fragment 46F (see methods) to probe a Southern Blot candying chromosomal DNA of Flavobacterium sp. R1534 
digested with different restriction enzymes Fig. 2. The 2.4 kb Xhol/PstI fragment hybridizing to the probe seemed the 
most appropiate one to start with. Genomic Flavobacterium $p, R1534 DNA was digested with Xhot/PstI and run on a 
1% agarose gel. According to a comigrating DNA marker, the region of about 2.4 kb was cut out of the gel and the DNA 

10 isolated. A Xhol/PstI mini library of Flavobacterium sp. R1534 genomic DNA was constructed into Xhol - PstI sites of 
pBluescriptllSK{+). One hundred E coli XL1 transfbrmants were subsequentely screened by PGR with primer #7 and 
primer # 8. the same primers previously used to obtain the 119 bp fragment (46F). One positive transfbrmant, named 
done 85, was found. Sequencing of the insert revealed sequences not only homologous to the phytoene synthase 
(crtB) but also to the phytoene desaturase (crti) of both Erwinia species herbioola and uredovora Left and right hand 

75 genomic sequences of done 85 were obtained by the same approach using probe A and probe B. Flavobacterium sp. 
R1534 genomic DNA was double digested with Gal and Hind 111 and subjected to Southern analysis with probe A and 
probe B. With probe A a Clal/Hindllt fragment of aprox. 1.8 kb was identified (Fig. 3A), isolated and subcloned into the 
Ctal/Hindlll sites of pBtuescriptllKS [-¥). Screening of the E. coli XL1 transfornnants with probe A gave 6 positive clones. 
The insert of one of these positives, done 43-3, was sequenced and showed homology to the N-terminus of crtI genes 

20 and to the C-terminus of crtY genes of both Erwinia species mentioned above. WtVn probe B an approx. 9.2 kb Clat/Hin- 
dlll fragment was detected (Rg. 3B). isolated and sutxkined into pBluescriptllKS (-i-). 

A screening of the transformants gave one positive, done 51. Sequencing of the 5' and 3* of the insert, revealed 
that only the region close to the Hindlll site showed relevant homology to genes of the carotenoid bk)synthesis of the 
Erwinia species mentioned above (e.g. alB gene and crtE gene). The sequence arourxi the Clal site showed no homol- 

S5 ogy to known genes of the carotenoid biosynthesis pathway. Based on this information and to facilitate further sequenc- 
ing and construction work, the 4.2 kb BamHI/Hindlll fragment of done 51 was subdoned into the respective sites of 
pBluescriptllKS(-f) resulting in done 2. Sequendng of the insert of this clone confirmed the presence of genes homol- 
ogous to EnM'nia sp. crtB and crtE genes. These genes were located within 1 .8 1^ from the Hindlll site. The remaining 
2.4 kb of this insert had no homology to known carotenoid biosynthesis genes. 

30 Additional genomic sequences downstream of the Clal site were detected using probe C to hybridize to Flavobac- 
terium sp. R1 534 genomic DNA digested with different restriction enzymes (see figure 4). 

A Sall/Hindlll fragment of 2.8 kb identified by Southern analysis was isolated and sut>doned into the Hindi il/Xhol 
sites of pBluescriptllKS (+). Screening of the E. coli XL1 transformants with probe A gave one positive clone named 
clone 59. The insert of this clone confirmed the sequerKse of done 43-3 and contained in addition sequences homolo- 

35 gous to the N-terminus of the crtY gene from other known lycopene cydases. To obtain the putative missing crtZ gene 
a Sau3AI partial digestion library of Flavobacterium sp. R1534 was constructed into the BamHI site of pBluescriptl- 
IKS(+). Screening of this library with probe D gave several positive clones. One transfomnant designated, done 6a, had 
an insert of 4.9 kb. Sequendng of the insert revealed besides the already known sequences coding for crtB, crtl and 
crtY also the missing crtZ gene. Clone 7g was isolated from a mini library carrying Bctt/Sphl fragments of R1534 (Rg. 

40 5) and screened with probe D. The insert size of done 7g is approx. 3 kb. 

The six independent inserts of the clones described above covering approx. 1 4 kb of the Flavobacterium sp. R1 534 
genome are compiled in Figure 6. 

The determined sequence spanning from the BamHI site (position 1) to base pair 8625 is shown figure 7. 

45 Putative protein coding regions of the cloned R1 534 sequence. 

Computer analysis using the CodonPreference program of the GCQ package, which recognizes protein coding 
regions by virtue of the similarity of their codon usage to a given codon frequency table, revealed eight open reading 
frames (ORFs) encoding putative proteins: a partial ORF from 1 to 1 165 (ORF-5) coding for a polypeptide larger than 

50 41382 Da; an ORF coding for a pdypeptide with a molecular weight of 40081 Da from 1 180 to 2352 (ORF-1); an ORF 
coding for a polypeptide with a molecular weight of 31331 Da from 2521 to 3405 (crtE); an ORF coding for a polypeptide 
with a molecular weight of 32615 Da from 431 6 to 3408 (crtB); an ORF coding for a pdypeptide with a molecular weight 
of 5441 1 Da from 5797 to 4316 (crtl); an ORF coding for a pdypeptide witii a nrv3lecular weight of 42368 Da from 6942 
to 5797 (crtY); an ORF coding for a polypeptide with a molecular weight of 1 9282 Da from 7448 to 6942 (crtZ); arxJ an 

55 ORF coding for a polypeptide with a molecular weight of 1 9368 Da from 831 5 to 7770 (ORF-i 6); pRF-.l and crtE have 
the opposite transcriptional orientation from the otheris (Rg. 6). Tbe translation start sites of the ORFs crtl, crtY and crtZ 
could clearly be determined based on the appropiately located sequences homologous to the Shine/Delgano (S/D) 
[Shine and Dalgarno, Proc. Nati. Acad. Sci. USA 21 1342-1346 (1974)] consensus sequence AGG-6-9N--ATG (Rg. 
10) and the homology t tiie N-terminal sequences of the respective enzymes f E. herbicola and E uredovora. The 
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translation of the ORF crtB could potentially start from three closely spaced codons ATG (4316), ATG (4241) and ATG 
(421 1). The first one. although not having the best S/D sequence of the three, gives a translation product with the high- 
est homology t the N-terminus of the E. herbicola and E. uredovora crtB protein, and is therefore the most likely trans- 
lation start sita The translation of ORF crtE could potentially start from five different start codons found within 150 bp : 
5 ATG (2389). ATG (2446), ATG (2473), ATG (2497) and ATG (2521). We believe that based on the following observa- 
tions, the ATG (2521 ) is the most likely transcription start site of crtE: this ATG start codon is proceeded by the best con- 
sensus S/D sequence of all five putative start sites mentioned; and the putative N-terminal amino acid sequence of the 
protein encoded has the highest homology to the N-terminus of the crtE enzymes of E. herbicola and E. uredovora\ 



10 Characteristics of the art translational initiation sites and gene products. 



The translational start sites of the five carotenoid biosynthesis genes are shown below and the possible ribosome 
binding sites are underlined. The genes otZ, crtY, crti and crtB are grouped so tightiy that the TGA stop codon of the 
anterior gene overlaps the ATG of the following gene. Only three of tiie five genes (crtl, crtY and crtZ) fit with the con- 
15 sensus for optimal S/D sequences. The txixed TGA sequence shows the stop condon of the anterior gena 



-10 +1 



20 


ACG AaSQC ACCGATG ACGCCCA 


crtE 




CGGACCTGGC CXST CGC^^CCGATC 


crtB 


25 


CGGATCGC AA TAG i^SfilGCC ATG 


crtY 




CTGCAGGAGAGAGCi^gG^GTTCCG 


crtl 


30 


GCAASSGGCCGGCATGAGC AC TT 


crtZ 



35 

Amino acid sequences of individual crt genes of Flavobacterlum sp. R1534. 

All five ORFs of Flavobacterium sp. R1534 having homology to known carotenoid biosynthesis genes of other spe- 
cies are clustered in approx. 5.2 kb of the sequence (Fig. 7). 

40 

GGDP synthasB (crtE) 

The amino add (aa) sequence of the geranylgeranyt pyrophosphate synthase (crtE gene product) consists of 295 
aa and is shown In figure 8. This enzyme condenses farnesyl pyrophosphate and isopentenyl pyrophosphate in a 1 ' - 4. 

45 

Phytoene synthase (crtB) 

This enzyme catalyzes two enzymatic steps. Rrst it condenses in a head to head reaction two geranytgeranyl pyrophos- 
phates (C20) to the 040 carotenoid prephytoene. Second it rearanges the cyciopropylring of prephytoene to phytoene. 
so The 303 aa encoded by the crtB gene of Flavobacterium sp. R1 534 is shown in figure 9. 

Phytoene desaturase (crt!) 

The phytoene desaturase of Flavobacterium sp. R1534 consisting of 494 aa, shown in figure 10. performs like the 
55 crtl enzyme of E. herbicola and E. uredovora, four desaturation steps, converting the non-coloured carotenoid phy- 
toene to the red coloured lycopene. 
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Lycopene cyclase (crtY) 

The crtY gene product of Flavobacterium sp. R1534 is sufficient to introduce the p-ionone rings at both sides of 
lycopene to obtain p-carotene. The lycopene cyclase of navobacterium sp. R1534 consists of 382 aa (Fig. 1 1). 

5 

p-carotene hydroxylase (crtZ) 

The gene product of crtZ consisting of 1 69 aa (Rg. 12) and hydroxylates p-carotene to the xanthophyll zeaxanthin. 

10 Putative enzymatic functions of the ORFs (orff-1 , orf-5 and orf-1 6) 

The orf-1 has at the aa level over 40% identity to acetoacetyl-CoA thiolases of different organisms (e.g. Candida 
tropicalis. human, rat). This gene is therefore most likely a putative acetoacetyl-CoA thiolase (acetyl-CoA acetyttrans- 
ferase), which condenses two molecules of acetyl-CoA to Acetoacetyt-CoA. Condensation of acetoacetyl-CoA with a 

IS third acetyl-CoA by the HMQ-CoA synthase forms p-hydroxy-p-methylglutaryl-CoA (HMQ-CoA). This compound is part 
of the mevalonate pathway which produces besides sterols also numerous kinds of isoprenoids with diverse cellular 
functions. In t)acteria and plants, the isoprenoid pathway is also able to synthesize some unique products like caroten- 
oids. growth regulators (e.g. in plants gibberellins and abcissic acid) and sencodary metabolites like phytoalexins [Riou 
et al.. Gene 143, 293-297 (1994)]. 

20 The orf-5 has a low honnology of approx. 30%. to the anrdno acid sequence of polyketide synthases from different 
streptomyces (e.g. S. violaceoruber. S. dnnamonensis). These antibiotic synthesizing enzymes (polyketide synthases), 
have been classified into two groups. Type-I polyketide synthases are large nrujltifunctional proteins, whereas type-ll 
polyketide synthases are muftiprotein complexes conposed of several individual proteins involved in the subreactions 
of the polyketide synthesis pbb. et al. Gene ±^ 31-39 (1994)]. 

25 The putative protein encoded by the Off-16 has at the aa level an identity of 42% when compared to the solutsle 
hydroger^se sutxjnit of Anabaena cylindrica. 

Functional assignment of the ORF 's (crtE, crtB, crti, crtY and crtZ) to enzymatic activities of the carotenold 
biosynthesis pathway. 

30 

The biochemical assignment of the gene products of the different ORPs were revealed by analyzing carotenoid 
accumulation in E. colt host strains that were transformed with deleted variants of the Flavobacterium sp. gene cluster 
and thus expressed not all of the crt genes (Rg. 13). . ■ . 

Three different plasmid were constructed: pLyco, p59-2 and pZea4. Plasmid p59-2 was obtained by subcloning the Hin^ 
35 dlll/BamHI fragment of clone 2 into the Hindlll/BamHI sites of done 59. p59-2 carries the ORF's of the crtE. crtB, crtI 

and crtY gene and should lead to the production of p-carotene. pLyco was obtained by deleting the Kpnl/Kpnl fragment. 

coding for approx, one half (N-terminus) of the crtY gene, from the p59-2 plasmid. E. coli cells transformed with pLyco. 

and therefore having a truncated non-functional crtY gene, should produce lycopene. the precursor of p-carotene. 

pZea4 was constructed by ligation of the Ascl-Spel fragment of p59-2, containing the crtE, crtB. crtI and most of the 
40 crtY gene with the Ascl/Xbal fragment of clone 6a. containing the sequences to complete the crtY gene and the crtZ 

gene. pZea4 [for complete sequence see Fig. 24; nucleotides 1 to 683 result from pBluescriptllKS(+), nucleotides 684 

to 8961 from Flavobacterium R1534 WT genome, nucleotides 8962 to 1 1233 from pBluescriptllKS(+}] has therefore all 

five ORPs of the zeaxanthin biosynthesis pathway. Plasmid pZea4 has been deposited on May 25, 1995 at the DSM- 

Deutsche Sammlung von Mikroorganismen und Zellkulturen GmbH (Germany) under accession No. DSM 10012. E. 
45 coli cells transformed witii this latter plasmid should therefore produce zeaxanthin. For the detection of tiie carotenoid 

produced, transfbrmants were grown for 48 hours in shake flasks and then subjected to carotenoid analysis as 

described in the methods section. Figure 13 summarizes the different inserts of the plasmids described atx)ve. and the 

main carotenoid detected in the cells. 

As expected the pLyco carrying E. coli cells produced lycopene, those carrying p59-2 produced p-carotene (all- 
50 E,9-Z,1 3-Z) and the cells having the pZea4 construct produced zeaxanthin. TNs confirms that all the necessary genes 

of Flavobacterium sp. R1534 for the synthesis of zeaxanthin or their precursors (phytoene, lycopene and p-carotene) 

were doned. 

Example 3 

55 

IViaterials and methods used for expression of carotenoid synthesizing enzymes 

Bacterial strains and plasmids: The vectors pBluescriptllKS (-«•) or (-) (Sb-atagene. La Jolla. USA) and pUC18 
[Vieira and Messing, Gene la. 259-268 (1982); Norrander et al.. Gene gfi, 101-106 (1983)] were used for cloning in dif- 



11 



EP0 747 483 A2 



ferent E. co// strains, like XL-1 blue (Stratagene). TGI or JM109. In af) a subtil^ transfbnnations, strain 1012 was used. 
Plasmids pHP13 [Haima et al.. Mol. Gen. Genet goa. 335-342 (1987)] and p602/22 [LeGrice. S.F.J. in Gene Expres- 
sion Technology. Goeddel. O.V. Editor, 201-214 (1990)] are Gram (+)/(-) shuttle vectors able to replicate in B. subtilis 
and £. colt cells. Plasnnid p205 contains the vegl promoter doned into the Smal site of pUC1 8. Plasmid pXII 2 is an inte- 

5 graton vector for the constitutive expression of genes in S. subWis [Haiker et al., in 7th Int. Symposium on the Genetics 
of Industrial Microorganisms, June 26^uly 1, 1994. Mongreal, Quebec, Canada (1994)]. Plasmid pBESTSOl [Itaya et 
al., Nucleic Adds Res. IZ (1 1). 4410 (1989)] contains the neomycin resistance gene cassette originating from the plas- 
rrtd pUB1 10 (GenBank entry: M19465) of S. aureus [McKenzie et al.. Plasmid 15, 93-103 (1986); McKenzie et al., Plas- 
mid 12. 83-84 (1987)]. This neomydn gene has been shown to work as a selection marter when present in a single 

10 copy in the genome of 0. subtilis. Plasmid pC194 (ATCC 37034)(GenBank entry: L08860) originates from S. aureus 
[Horinouchi and WeisUaum. J. Bacteriol. 150. 815-825 (1982)] and contains the chloramphenicol acetyttransferase 
gene. 

Media and growth conditions: E. oo// were grown in Luria broth (LB) at 37® C with lOO^g Ampicillin (Amp)/m! for 
selection. B. subtilis cells were grown in VY-medium supplemented with either erythromycin (1 Mg/ml). neomycin (5-180 

15 ^ig/ml) or chloranphenicol (10-80 jig/ml). 

Transformation: E. coH transformations were done by electroporation using the Gen-pulser device of BIO- RAD 
(Hercules. CA, USA) with the following parameters (200 SX 250 ^FD, 2.5V). B. subtiiis transformations were done basi- 
cally according to the standard procedure method 2.8 described by [Cutting and Vander Horn in Molecular Biological 
Methods for Baallus. Hanwood. C.R and Cutting. S.M., Editor, John Wiley & Sons: Chichester, England. 61-74 (1990)]. 

20 Colony screening: Bacterial colony screening was done as described by [Zon et al., s.a.]. 

Oligonucleotide synthesis: The oligonucleotides used for PCR reactions or for sequencing were synthesized with 
an Applied Biosystems 392 DNA synthesizer. 

PCR reactions: The PCR reactions were performed using either the UITma DNA polymerase (PerWn Elmer 
Cetus) or the Pf u Vent polymerase (New Englarxi Biolabs) according to the manufacturers instructions. A typical 50 ^1 

25 PCR reaction contained: lOOng of template DNA, 10 pM of each of the primers, all four dNTP's (final cone. 300 ^M), 
MgClg (when UITma polymerase was used; final cone. 2 mM), 1x UITma reaction buffer or 1x Ru buffer (supplied 
the manufacturer). All corrponents of the reaction with the exception of the DNA polymerase were incubated at 95°C 
for 2 min. followed by the cycles indicated in the respective section (see below). In ail reactions a hot start was made, 
by adding the polymerase in the first round of the cycle during the 72°C elongation step. At the end of the PCR reaction 

30 an aliquot was analysed on 1% agarose gel. before extracting once with phenol/chloroform. The amplified fragment in 
the aqueous phase was predpitated with 1/10 of a 3M NaAcetate solution and two volumes of Ethanol. After centrifu- 
gation for 5 min. at 1 2000 rpm. the pellet was resusperKled in an adequate volume of 1^20. typically 40 p.1. before dig^- 
tion with the indicated restriction enzymes was performed. Alter the digestion the mixture was separated on a 1% low 
melting point agarose. The PCR product of the expected size were excised from the agarose and purified using the 

35 glass beads method (GENECLEAN KIT. Bio 101 . Vista CA, USA) when the fragments were above 400 bp or directly 
spun out of the gel when the fragments were shorter than 400 bp. as descrit)ed by [Heery et al., TIBS (6), 1 73 (1 990)]. 

Ollgos used for gene amplification and site directed mutagenesis: 

40 All PCR reactions performed to allow the construction of the different plasntids are described below. All the primers 
used are summarized in figure 14. 

Primers #100 and #101 were used in a PCR reaction to anplify the complete crtE gene having a Spel restriction 
site and an artificial ribosomal binding site (RBS) upstream of the transcription start site of this gene. At the 3* end of 
the amplified fragment, two unique restriction sites were introduced, an Avrll and a Smal site, to facilitate the further 

45 cloning steps. The PCR reaction was done with UITma polymerase using the following conditions for the amplification: 
5 cycles with the profile: 95°C. 1 min7 60°C. 45 sec7 72°C. 1 min. and 20 cycles with the profile: 95°C. 1 min7 72°C, 1 
min.. Plasmid pBllKS(4-)-clone2 served as template DNA. The final PCR product was digested with Spel and Smal and 
Isolated using the GENECLEAN KIT. The size of the fragment was approx. 91 0 bp. 

Primers #104 and #105 were used in a PCR reaction to amplify the crtZ gene from the translation start till the Sail 

50 restriction site, located in the coding sequence of this gena At the 5' end of the crtZ gene an EcoRI, a synthetic RBS 
and a Ndel site was introduced. The PCR conditions were as described above. Plasmid pBllKS(+)-clone 6a served as 
template DNA and the final PCR product was digested with EcoRt and Sail. Isolation of the fragment of approx. 480 bp 
was done witii the GENECLEAN KIT. 

Primers l\AUT1 and MUT5 were used to arr^lify the complete crtY gene. At the 5* end. the last 23 nucleotides of the 

55 crtZ gene including the Sail site are present, followed by an artificial RBS preceding the translation start site of the crtY 
gena The artificial RBS created includes a Pmll restriction site. The 3' end of the amplified fragment contains 22 nucle- 
otides of the crtl gene, preceded by an newly created artif ial RBS which contains a Muni restriction site. The conditions 
used for the PCR reaction were as described above using the following cycling profOe: 5 rounds of 95^C, 45 8ec./60°C. 
45 sec./ 72°C, 75 sec. followed by 22 cycles with the profile: 95°C, 45 sec7 66*'C, 45 secJ 72°C, 75 sec.. Plasmid 
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pXI12-2YIB-EINV4 served as template for the Pfu Vent polymerase. The PGR product of 1225 bp was made blunt and 
cloned Into the Smal site of pUC18, using the Sure-Clone Kit (Pharmacia) according to the manufacturer. 

Primers MUT2 and MUT6 were used to amplify the complete crti gene. At th 5* the last 23 nucleotides of the crtY 
gene are present, followed by an artifidat RBS which precedes the translation start site of the crtI gene. The new BBS 

5 created, includes a Munt restriction site. The 3' end of the amplified fragment contains the artificial RBS upstream of the 
crtB gene including a BamHI restriction site. The conditions used for the PGR reaction were basically as described 
above including the following cycling profile: 5 rounds of 95°C, 30 sec7 60*^. 30 sea/ 72^G, 75 sec., followed by 25 
cycles with the profile: 95°C. 30 sec./66**G. 30 sec7 72°G. 75 sec. Plasmid pXI12-2YIB-EINV4 served as template for 
the Pfu Vent polymerase. For the further cloning steps the PGR product of 1541 bp was digested with Muni and BamHI. 

TO Primers MUT3 and CAR1 7 were used to amplify the N-terminus of the crtB gene. At the 5* the last 28 nucleotides 
of the crtI gene are present followed by an artificial RBS. preceding tiie translation start site of the crtB gene. This new 
created RBS. includes a BamHI restriction site. The anplified fragment named PGR-F contains also the Hind I II restric- 
tion site located at the N-terminus of the crtB gene. The conditions used for ttie PGR reaction were as desaibed else- 
where in the text, including the following cycling profile: 5 rounds of 95°G. 30 sec7 58°G. 30 sec./ 72**G, 20 sec. followed 

IS by 25 cycles with the profile: 95°C. 30 secY 60°G. 30 sec7 72°a 20 sec.. Plasmid pXI12-ZYIB-ElNV4 sen/ed as tem- 
plate for the Pfu Vent polymerase. The PGR product of approx. 160 bp was digested with BamHI and Hindi 1 1. 

OUgos used to amplify the chloramphenicol resistance gene (cat): 

20 Primers CAT3 and GAT4 were used to amplify tiie chloramphenicol resistance gene of pGI 94 (ATGG 37034) [Hori- 
nouchi and Weisblum. 8.a.] a R-plasmtd found in S. aureus. The conditions used for the PGR reaction were as 
desaibed previously including the following cycling profile: 5 rounds of 95^G. 60 secV 50°G. 60 sea/ TZ^C, 2 min. fol- 
lowed by 20 cycles with the profile: 95®G. 60 sec7 60**G. 60 sec7 72°G. 2 min.. Plasmid pG198 served as template for 
the Pfu Vent polymerase. The PGR product of approx. 1050 was digested with EcoRI and Aatll. 

25 Ollgos used to generate linkers: Linkers were obtained by adding 90 ng of each of the two corresponding primers 
into an Eppendorf tube. The mixture was dried in a speed vac and the pellet resuspended in 1x Ligation buffer (Boe- 
hringer. Mannheim. Qermany). The solution was incubated at 5Q°C for 3 min. before cooling down to RT. to allow the 
primers to hybridize properly. The linker were now ready to be ligated into the appropriate sites. All the oligos used to 
generate linkers are shown in figure 15. 

30 Primers CS1 and CS2 were used to form a linker containing the following restrictions sites Hind I II. Aflll, Seal, Xbal. 
Pmel and EcoRI. 

Primers MUT7 and MUT8 were used to form a linker containing ttie restriction sites Sail. Avrll. Pmll, MIuI. Muni, 
BamHI. Sphi and Hindlll. 

Primers MUT9 and MUT10 were used to introduce an artificial RBS upstream of crtY. 
35 Primers MUT11 and MUT12 were used to introduce an artificial RBS upstream of crtE. 

Isolation of RNA: Total RNA was prepared from log phase growing B. subti/is according to the method described 
by [Maes and Messens. Nucleic Acids Res. gQ (16). 4374 (1992)]. 

Northern Blot analysis: For hyt>ridization experiments up to 30 ^g of S. subtilis RNA was electrophoreses on a 
1% agarose gel made up in 1x MOPS and 0.66 M formaldehyde. Transfer to Zeta-Probe blotting membranes (BIO- 
40 RAD). UV cross-linking, pre-hytxidization and hybridization was done as described elsewhere in [Farrell, J.R.E.. RNA 
Methodologies. A laboratory Guide for isolation and characterization. San Diego, USA: Academic Press (1993)]. The 
washing conditions used were: 2 x 20 min. in 2xSSPE/0.1% SDS followed by 1 x 20m)n. in 0.1% SSPEA).1% SDS at 
65°G. Northern t)lots were then analyzed either by a Phosphorimager (Molecular Dynamics) or by autoradiography on 
X-ray films from Kodak. 

45 Isolation of genomic DNA: 6. subtilis genomic DNA was isolated from 25 ml overnight cultures according to the 
standard procedure method 2.6 described by [1 3]. 

Southern blot analysis: For hybridization experiments 8. subtilis genomic DNA (3 |xg) was digested witii tiie 
appropriate restriction enzymes arxd electrophoresed on a 0.75% agarose gel. The transfer to Zeta-Probe blotting 
membranes (BIO-RAD). was done as described [Southern. E.M., s.a.]. Prehybridization arKi hybridization was in 

50 7%SDS. 1% BSA (fraction V; Boehringer). 0.5M Na2HP04. pH 7.2 at 65^G. Alter hybridization the membranes were 
washed twice for 5 min. in 2x SSC, 1% SDS at room terrperature and twice for 15 min. in 0.1% SSC, 0.1% SDS at 65° 
G- Southern blots were then analyzed either by a Phosphorimager (Molecular Dynamics) or by autoradiography on X- 
ray films from Kodak. 

DNA sequence analysis: The sequence was determined by the dideoxy chain termination technique [Sanger et 
55 al., s.a.] using the Sequenase Kit Version 1.0 (United States Biochemical). Sequence analysis were done using the 
GGG sequence analysis software package (Version 8.0) k>y Genetics Gomputer, Inc. [De/ereux et al.. s.a.]. 

Gene amplification in B. subtWs: To amplify the copy number of the SFGO in B. subtiiis transformants. a single 
colony was inoculated in 15 ml VY-medium supplemented with 1 .5 % glucose and 0.02 mg chloramphenicol or neomy- 
cin/ml, dependend n tiie antibiotic resistance gene present in the amplrfiable structure (see results and discussion). 
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The next day 750 fil of this cufture were used to inoculate 13 ml VY-medium containing 1.5% glucose supplemented 
with (60, 80. 120 and 150 |ig/nnl) for the cat resistant nuitants. or 160 jig/ml and 180 \iqM for the neomycin resistant 
mutants). The cultures were grown overnight and th next day 50 jiJ of different dilutions (1 : 20. 1 :400, 1 : 8000, 1 : 
160*000) were plated on VY agar plates with the appropriate antibiotic concentration. Large angle colonies were then 
further anafyzed to determine the number of copies and the amount of carotenoids produced. 

Analysis of carotenoids: E. cofi or B. subtilis transformants {200 - 400 nrd) were grown for the times indicated in 
the text, usually 24 to 72 hours, in LB-medium or VY-medium, respectively, supplemented with anttoiotics, in shake 
flasks at 37** C and 220 rpm. 

The carot^oids produced by the microorganisms were extracted with an adequate volume of acetone using a rota- 
tion homogenizer (Polytron, Wnematica AG, CH-Luzem). The homogenate was the filtered through the sintered glass 
of a suction fitter into a round bottom flask. The filtrate was evaporated by means of a rotation evaporator at 50^ C using 
a water-jet vacuum. For the zeaxanthin detection the residue was dissolved in n-hexane/acetone (86:14) before analy- 
sis with a normalphase HPLC as descn'bed in [Weber. S., s.a.]. For the detection of p-carotene and lycopene the evap- 
orated extract was dissolved in n-hexaneAacetone (99:1) and analysed by HPLC as described in Hengartner etal.. s.a.]. 

Example 4 

Carotenofd production In £■ co^ 

The biochemical assignment of the gene products of the cfifferent open reacfing frames (ORPS) of the carotenoid 
biosynthesis cluster of Flavobacterium sp. were revealed by analyzing the carotenoid accumulation in £. coti host 
strains, transfbnned with plasmids carrying deletions of the F/avobacterhjm sp. gene cluster, and thus lacking some of 
the crt gene products. Similar functional assays in E. coli have been described by other authors [Misawa et al.. s.a.; 
Perry et al., J. Bacteriol.. Ifig. 607-612 (1986); Hundle. et al„ Molecular and General Genetics 254 (4). 406-416 (1994)]. 
Three different plasmid pLyca pBIIKS(+)-clone59-2 and p2ea4 were constructed from tiie three genomic isolates pBI- 
IKS('i-)-done2, pBIIKS(+)-clone59 and pBnKS(+)-clone6a (see figure 16). 

Plasmid pBIIKS(+)-clone59-2 was obtained by subdoning the Hindlll/BamHI fragment of pBIIKS(+)-done 2 into the 
Hindltl/BamHI sites of pBIIKS(-f)-clone59. The resulting plasmid pBIIKS(4-)-clone59-2 carries tiie complete ORPs of 
the crtE, crtB. crti and atY gene and shoukJ lead to the production of p-carotene. pLyco was obtained by deleting the 
KpnI/Kpnl fragment, coding for approx. one half (N-terminus) of the crtY gene, from the plasmid pBIIKS(+)-clone59-2. 
E. coli cells transformed witii pLyco, and therefore having a truncated non-functional crtY gene, should produce lyco- 
pene, ttie precursor of p-carotene. pZea4 was constructed by ligation of ttie Ascl-Spel fragment of pBIIKS(+)-clone59- 
2, containing the crtE, crtB, crtI and most of the crtY gene with the Ascl/Xbal fragment of clone 6a. containing the crtZ 
gene and sequences to complete the tioincated crtY gene mentioned above. pZea4 has therefore all five ORPs of the 
zeaxanthin biosynthesis pathway E. coti cells transformed with this latter plasmid should ttierefbre produce zeaxanthin. 
For the detection of the carotenoid produced, transformants were grown for 43 hours in shake flasks and then subjected 
to carotenoid analysis as described in the methods section. Rgure 16 summarizes the construction of the plasmids 
described above. 

As expected the pLyco carrying E. coli cells produced lycopene. those carrying pBllKS(+)-clonB59-2 produced p- 
carotene (all-E.9-Z,13-^ and tiie cells having the pZea4 construct produced zeaxarthin. This confirnw that we have 
cloned all the necessary genes of Flavobacterium sp. R1534 for tfie synthesis of zeaxanthin or their precursors (phy- 
toene. lycopene and p-carotene). The production levels obtained are shown in table 1. 
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plasmid 


host 


zeaxanthin 


P-carotene 


lycopene 


pLyco 


E. colt JM109 


ND . 


ND 


0.05% 


pBIIKS(+)-clone59-2 


m 


ND 


0.03% 


ND 


pZea4 


N 


0.033% 


0.0009% 


ND 



Table 1: Carotenoid content of E. coli transformants, carrying the plasmids 
pLyco, pBIIKS(+)-done59-2 and pZea4, after 43 hours of culture 
in shake flasks. The values indicated show the carotenoid content 
in % of the total dry cell mass (200 ml). ND = not detectable- 



Carotenoid production In g, subtiUs 

In a first approach to produce carotenoids in 6. subtilus, we cloned the carotenoid biosynthesis genes of 
Flavobacterium into the Gram (+)/(-) shuttle vectors p602/22. a derivative of p602/20 [LeGrice, S.RJ.. s.a.]. The assem- 
bling of the final construct p602-CARVEG-E. begins with a triple ligation of fragments Pvull-Avrll of pZea4(deI654- 
3028) and tiie Avrll-EcoRI fragment from plasmid pBIIKS(+]-clone6a, into the EcoRI and Seal sites of the vector 
p602/22. The plasmid pZea4(del654-3028} had been obtained by digesting pZea4 with Sad and Espl. The protruding 
and recessed ends were made blunt with Ktenow enzyme and religated. Construct pZea4(del554-3028) lacks most of 
the sequence upstream of crtE gene, which are not needed for the carotenoid biosynthesis. The plasmid p602-CAR has 
apprax. 6.7 kb of genomic Flavobacterium R1534 DNA containing besides all five carotenoid genes (approx. 4.9 kb), 
additional genomic DNA of 1.2 kb, located upstream of the crtZ translation start site and further 200 bp, located 
upstream of crtE transcription start. TTie crtZ. crtY, crti and crtB genes were cloned downstream of the Pr42S/o pronooter, 
a regulatable E. coli bacteriophage T5 promoter derivative, fused to a lac operator element, which is functional in 8. 
subtilis [LeGrice. S.F.J.. s.a.]. It is obvious that in ti^e p602CAR construct, the distance of over 1200 bp between the 
P|si25/o promoter and the transcription start site of crtZ is not optima) and will be irrproved at a later stage. An outiine of 
the p602CAR construction is shewn in figure 17. To ensure transcription of the crtE gene in B. subtilis, the vegl pro- 
moter [Moran et al.. Mol. Gen. Genet 186. 339-346 (1982); LeGrice et al., Mol. Gen. Genet 204. 229-236 (1986)] was 
introduced upstream of this gene, resulting in the plasmid construct p602-CARVEG-E. The vegl promoter, which origi- 
nates from site I of the veg promoter complex described by [LeGrice et al., s.a] has been shown to k>e functional in E. 
coli [Moran et al., s.a.]. To obtain tiiis new construct, the plasmid p602CAR was digested with Sail and HindlU, and the 
fragment containing the complete crtE gene and most of the crtB coding sequence, was subcloned into the Xhol and 
Hindi 1 1 sites of plasmid p205. The resulting plasmid p205CAR contains the crtE gene just downstream of the Pvegl pro- 
moter. To reconstitute the carotenoid gene cluster of Flavobacterium sp. the following three pieces were isolated: 
Pmel/Hindlll fragment of p205CAR, the Hincll/Xbal fragment and the EcoRI/Hlndlll fragment of p602CAR and ligated 
into tiie EcoRl and Xbal sites of pBlue8criptllKS(-^), resulting in the constiruct pBIIKS('i-)-CAR\/EG-E. Isolation of the 
EcoRl-Xbal fragment of this latter plasmid and ligation into the EcoRI and Xbal sites of p602/22 gives a plasmid similar 
to p602CAR but having tiie crtE gene driven by the Pvegl promoter. All the construction steps to get the plasmid 
p602CARVEG-E are outiined in figure 18. E. coli TGI cells transformed with this plasmid synthesized zeaxanthin. In 
contrast B. subtilis sbain 1012 ti'ansfbrmed with the same constructs did not produce any carotenoids. Analysis of sev- 
eral zeaxanthin negative B. subtilis transfbrmants always revealed, that the transformed plasmids had undergone 
severe deletions. This instability coukd be due to the large size of the constructs. 

In order to obtain a stable construct in B. subtilis, the carotenoid genes wer cloned into the Gram (+)/(-) shuttie 
vector pHP13 constiucted by [Haima et al., s.a.]. The stability problems were thought to be omitted by 1) reducing the 
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size of the cloned insert which cames the carotenoid genes and 2) reversing the orientation of the crtE gene and thus 
only requiring one promoter for the expression of all five genes» instead of two, like in the previous constructs. Further- 
nK>re, the ability of cells transformed by such a plasmid carrying the synthetic Flavobacterium carotenoid operon 
(SFCO), to produce carotenoids, would answer the question if a modular approach is feasible. Rgure 19 summarizes 

5 all the construction steps and intermediate plasmids made to get the final construct pHP13-2PNZYIB-EINV. Briefly: To 
facilitate the following constructions, a vector pHP13-2 was made, by introducing a synthetic linker obtained with primer 
CS1 and CS2. between the Hindlll and EcoRI sites of the shuttle vector pHP13. The intermediate construct pHP13- 
2CARVEG-E was constructed by subdoning the Aflll-Xbal fragment of p€02CARVEG-E into the Aflll and Xbal sites of 
pHP13-2. The next step consisted in the inversion of crtE gene, by removing Xbal and Avrll fragment containing the 

10 original oiE gene and replacing it with the Xbal-Avrll fragment of plasmid pBIIKS(-i-)-PCRRBScrtE. The resulting plas- 
mid was named pHP13-2CARZYIB-EINV and represented the first construction wHh a functional SFCO. The interme- 
diate construct pBIIKS(+)-PCRRBScrtE mentioned above, was obtained by digesting the PGR product generated with 
primers #100 and #101 with Spel and Smal and ligating into the Spel and Smal sites of pBluescriptllKS(+). In order to 
get the crtZ transcription start dose to the promoter Pm2s/o a triple ligation was done with the BamHI-Sall fragment of 

IS pHPI 3-2CAR2YIB-EINV (contains four of the five carotenoid genes), the BamHI-EcoRI fragment of the same plasmid 
containing the PN25rt) promoter arxl the EcoRI-Sall fragment of pBllKS(+)-PCRRBScrtZ. having most of the crtZ gene 
preceded by a synthetic RBS. The aforementioned plasmid pBIISK(+)-PCRRBScrtZ was obtained by digesting the 
PGR product amplified with primers #104 and #105 with EcoRI and Sail and ligating into the EcoRI and Sail sites of 
pBluescriptllSK(-i-}. In the resulting vector pHP13-2PN252rYIB-ElIsiV, the SFCO is driven by the bacteriophage T5 pro- 

20 moter Pn2sa), which should be constitutively expressed, due to tiie absence of a functional lac repressor in the construct 
[Peschke and Beuk. J. Mol. Biol. 186. 547-555 (1 985)]. E. cofi TGI cells transformed with this construct produced zeax- 
anthin. Nevertheless, when Ms plasmid was transformed into B. subtilis, no carotenoid production could be detected. 
Analysis of the plasmicte of these transformants showed severe deletions, pointing towards instability problems, similar 
to the observations made with the aforementioned plasmids. 

25 

Examples 6 

Chromosome Integration Consirucis 

30 Due to tfie instability observed with the previous constructs we decided to integrate the carotenoid biosynthesis genes 
of Flavobacterium sp, into the genome of 3. subtHis using the integration/expression vector pXI12. This vector allows 
the constitutive expression of whole operons after integration into the levan-sucrase gene (sacB) of the B. subtilts 
genome. The constitutive expression is driv«i by the vegl promoter and results In medium level expression. The plas- 
mid pXI12-2YIB-EINV4 containing the synthetic Flavobacterium carotenoid operon (SFCO) was constructed as fbl- 

35 lows: the Ndel-Hincll fragment of pBIISK(+)-PCRRBScrtZ was doned into tiie Ndel and Smal sites of pXI12 and the 
resulting plasnrtid was named pXI12-PCRcrtZ. In the next step, the BstEII-Pmel fragment of pHP13-2PN25ZYIB-ElNV 
was ligated to the BstEll-Pmel fragment of pXI12-PCRcrtZ (see figure 20). a subtilts transformed with the resulting 
construct pXI12-ZYIB-EINV4 can integrate the CAR genes either via a Campbell type reaction or via a reciprocal 
reconrtoination. One transformant, BS1012:2YIB-EINV4. having a redprocal recombination of the carotenoid biosyn- 

40 thesis genes into the levan-sucrase gene was further analyzed (figure 21). Although this strain did not synthesize car- 
otenoids, RNA analy^s by Northern blots showed the presence of specific polycistronic mRNA's of 5.4 kb and 4.2 kb 
when hybridized to probe A (see figure 21, panel B). Whereas the larger mRNA has the expected message size, the 
origin of the shorter mRNA was unclear. Hybridization of the same Northern t^lot to probe B only detected the large 
mRNA fragment, pointing towards a premature termination of the transcription at the end of the crtB gena The pres- 

45 ence of a termination signal at this location would nrtake sense, since in the original operon organisation in the Flavo- 
bacterium $p. R1534 genome, the crtE and the crtB genes are facing each other. With this constellation a transcription 
termination signal at tiie 5* end of crtB would make sense, in order to avoid the synthesis of anti-sense RNA which could 
interfere witii the mRNA transcript of the crtE gene. Since this region has been changed oonsiderabfy with respect to 
the wiki type situation, the sequences constituting this terminator may also have been altered resulting in a leaky" ter- 

50 nrdnator. Western blot analysis using antisera against the different crt enzymes of the carotenoid pathway, pointed 
towards the possibility that the nlxtsomal binding sites might be responsible for the lack of carotenoid syrrthesis. Out of 
the five genes introduced only the product of crtZ. the p-carotene hydroxylase was detectable. TTiis is the only gene pre- 
ceded by a RBS site, originating from tiie pXI12 vector, known to be functional in B. subtilts. Base pairing interactions 
between a mRNA's Shtne-Dalgamo sequence [Shine and Delagamo, s. a.] arxi the 16S rRNA, which permits the ribe- 

55 some to select the proper initiation site, have been proposed by [McLaughlin et al.. J. Biol. Chem. 25S,1 1283-1 1291 
(1981)] to be much more statrfe in Granri-positive organisms (0. subtilts) than in Gram-negative organisms (£ coli"). In 
order to obtain highly stable complexes we exchanged the RBS sites of the Gram-negative Flavobacterium sp., preced- 
ing each of the genes crtY, crti, crtB and crtE, witii synthetic RBS's which were designed complementary to the 3' end 
of the a subtilis 16S rRNA (see table 2). This exchange should allow an effective translation initiation of tiie different 
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carotenoid genes in E subtilis. The strategy chosen to construct this pXt12-ZYIB-EiNV4MUTRBS2C. corrtaining all four 
altered sites is summarized in figure 20. In order to facilitate the further cloning steps in pBluescriptllKS(-^). additional 
restriction sites were introduced using the linker obtained with primer MUT7 and MUT8. cl ned between the Sail and 
HIndlll sites of said vector. The new resulting construct pBIIKS(+)-LINKER78 had the following restriction sites intro- 

5 duced: Avrll, Pmll. Mull. Muni, BamHI and Sphl. The general approach chosen to create the synthetic RBS's upstream 
of the different carotenoid genes, was done using a combination of PGR based mutagenesis, where the genes were 
reconstructed using defined primers candying the modified RBS sites, or using synthetic linkers having such sequences. 
Reconstitution of the RBS preceding the crti and crtB genes was done by amplifying the crti gene with the primers 
MLrr2 and MUT6, which include tiie appropriate altered RBS sites. The PCR*I fragment obtained was digested with 

70 Muni and BamHI and ligated into the Muni and BamHI sites of pBIIKS(+)-LINKER78. The resulting Intermediate con- 
struct was named pBIIKS(+)-LINKER78PCRI. Reconstitution of the RBS preceding the crtB gene was done using a 
small PGR fragmerrt obtained with primer MUT3, carrying the altered RBS site upstream of crtB, and primer GAR17. 
The amplified PGR-F fragment was digested with BamHI and Hindllt and sub cloned into the BamHI and Hindlll sites 
of pBIIKS{+)-UNKER78. resulting in the consti^uct pBIIKS(+)-UNKER78PGRF. "Rie PCR-I fragment was cut out of pBI- 

is IKS(+)-LINKER78PGRI with BamHI and Sapl and ligated into the BamHI and Sapl sites of pBllKS(-i-)-UNKER78PGRF. 
The resulting plasmid pBIIKS(+)-LINKER78PCRFI has the PCR-I fragment fused to the PCR-F fragment. This con- 
struct was cut with Sail and Pmll and a synthetic linker obtained by annealing of primer MUT9 and MUT10 was intro- 
duced. This latter step was done to facilitate the upcoming replacement of the original Fiavobacterium RBS in the above 
mentioned construct. The resulting plasmid was named pBIIKS(+)-LINKER78PGRFIA. Assembling of the synthetic 

20 RBS's preceding the crtY and crti genes was done by PGR. using primers MUT1 and MUT5. The amplified fragment 
PGR-G was made blunt end before doning into the Smal site of pUG1 8. resulting in construct pUG18-PGR-Q. The next 
step was the cloning of the PGR-G fragment between tiie PCR-A and PGR-I fragments. For tiiis purpose the PGR-G 
was isolated from pUG18-PGR-G by digesting with Muni and Pmll and ligated into the MunI and Pmll sites of pBllKS(-f)- 
LINKER78PGRFIA. This construct contains all four fragments, PGR-F, PGR-I, PGR-G and PGR-A. assembled adjacent 

25 to each other and containing three of the four artificial RBS sites (crtY. crti arxi crtB). The exchange of the 
Ftavobacterium RBS's preceding the genes crtY. crti and crtB by synthetic ones, was done by replacing tiie Hindlll-Sall 
fragment of plasmid pXI12-ZYIB-EINV4 with the Hindlll-Sall fragment of plasmid pBllKS(+)-LINKER78PGRFIGA. The 
resulting plasmid pXI12-ZYIB-EINV4 MUTRBSG was subsequentiy transformed into E, coli TG1 cells and S. subtilis 
1012. The production of zeaxanthin by these cells confirmed that the PGR amplified genes where functional. The B. 

30 subtilis strain obtained was named BS1012::SFGO1 . The last Fiavobacterium RBS to be exchanged was the one pre- 
ceding Vne crtE gena This was done using a linker obtained using primer MUT1 1 and MUT12. The wild type RBS was 
removed from pXI12-ZYIB-EINV4MLrTRBS with Ndel and Spel and the above mentioned linker was inserted. In the 
construct pXI12-ZYIB-EINV4MUTRBS2G all Fiavobacterium RBS's have been replaced by synthetic RBS's of the con- 
sensus sequence AAAGGAGG- 7-8 N -ATG (see table 2). E, coli TGI cells transformed with this construct showed that 

35 also this last RBS replacement had not interferred 



Tariff 2 

mJLNA 
crtZ 
crtY 

45 

crti 
crtB 

so 

CTtE 



nttclcQtidcscqwcnce 

AAAGGAGGGUUUC AUAIIfiAG C 
AAAGGA6GACACGUGAUGAGG 
AAAGGAGGCAAUUGAGajZSAGU 

aaaggaggau co a au c aiigac c 
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B. subtilis 
E. coli 



16S rRNA 3'-aCUUUCCUCCACDAG 
16S rRNA 3 - AUUCCUCCACUAG 



Table 2: 



Nudeotide sequences of the synthetic ribosome binding 
sites in the constructs pXI12-ZYIB-EINV4MUTRBS2C, 
pXI12-ZYIB-EINV4MUTRBS2CCAT and pXI12-ZYIB- 
EINV4 MUTRBS2CNrEO. Nucleotides of the Shine- 
Dalgamo sequence preceding the individual carotenoid 
genes which are complementary to the 3* ends of the 16S 
rRNA of B. subtilis are shown in bold. The 3* ends of the 16S 
rRNA of E. coli is also shown as comparison. The 
underlined AUG is the translation start site of the 
mentioned gene. 



with the ability to produce zeaxanthin. All the regions containing the newly introduced synthetic RBS's were confirmed 
by sequencing. B. subtilis cells were transformed with plasmid pXI12-ZYIB-ElNV4MUTRBS2 and one transfornmnt 
having integrated the SFCO by redprocal recombination, into the levan-sucrase gene of the chromosome, was 
selected. This strain was named BS1012::SFCO2. Analysis of the carotenoid production of this strain show that the 
amounts zeaxanthin produced is approx. 40% of the zeaxanthin produced by E. coli cells transformed with the plasmtd 
used to get the E subtilis transformant Similar was the observation when conrparing the BS1012::SFCO1 strain with 
its E. coli counter part (30%). Although the E. coli cells have 1 8 times more carotenoid genes* the carotenoid production 
is only a factor of 2-3 times higher More drastic was the difference observed in the carotenoid contents, between E. coli 
cells carrying the pZea4 construct in about 200 copies and the E. coli carrying the plasmid pXI12-ZYIB- 
EINV4MUTRBS2C in 18 copies. The first transformant produced 48x more zeaxanthin than the latter one. This differ- 
ence seen can not only be attn'buted to the roughly 1 1 times nrtore carotenoid biosynthesis genes present in these trans- 
formants. Contributing to this difference is probably also the suboptimal performance of the newly constructed SFCO, 
in which the overlapping genes of the wild type Flavobacterium operon were separated to introduce the synthetic 
RBS's. This could have resulted in a lower translation efficiency of the rebuild synthetic operon (ag. due to elimination 
of putative translational coupling effects, preserrt in tiie wild type operon). 

In order to increase the carotenoid production, two new constructs were made, pXl12-ZYIB- 
E1NV4MUTRBS2CNEO and pXI12-ZYIB-EINV4 MUTRBS2GCAT. which after the integration of the SFCO into tiie 
levan-sucrase site of tine chromosome, generate strains with an ampltfiable stixicture as described by [Janniere et al.. 
Gene 4a 47-55 (1985)]. Plasmid pXM2-ZYIB-EINV4MUTRBS2CNEO has been deposited on May 25, 1995 at the 
DSM-Deutsche Sammlung von Mikroorganismen und Zellkulturen GmbH (Germany) under accession No. DSM 10013. 
Such ampltfiable structures, when linked to a resistance marker (e.g chloramphenicol, neomycin, tetracycline), can be 
amplified to 20-50 copies per chromosome. The amplifiable structure consist of the SFCO. the resistance gene and the 
pXI12 sequence, flanked by direct repeats of the sac-B 3* gene (see figure 22). New strains having elevated numbers 
of the SFCO could now be obtained by selecting for transformants with increased level of resistarx:e to the antibiotic. 
To construct plasmtd pXI12-ZYlB-EINV4MUTRBS2CNEO. the neomycin resistance gene was isolated from plasmid 
pBESTSOl witii PstI and Smal and subcloned into the PstI and EcoO1091 sites of the pUCIS vector. The resulting con- 
struct was named pUC18-Neo. To get ttie final construct the Pmel - Aatll fragment of plasmtd pXI12-ZYlB- 
EINV4MUTRBS2C was replaced with the Smal-Aatll fragment of pUC18-Nea containing the neomycin resistance 
gene. Plasmid pXI12*ZYIB-EINV4MUTRBS2CCAT was obtained as follows: the chloramphenicol resistance gene of 
pCI 94 was isolated by PCR using the primer pair cat3 and cat4. The fragment was digested with EcoRI and Aatll and 
subcloned irrto the EcoRI and Aatll sites of pUCi 8. The resulting plasmid was named pUC18-CAT. The final vector was 
obtained by repladng the Pmel-Aatll fragment of pXI12-2r)riB-EINV4MUTRBS2C with the EcoRI-Aatll fragment of 
pUC18-CAT, carrying the chloramphenicol resistance gene. Figure 23 summarizes the different steps to ok)tain afore- 
mentioned constructs. Both plasmids were transformed into B. subtilis strain 1012, and transformants resulting from a 



18 



EP 0 747 483 A2 



Campbell-type integration were selected. Two strains BS1012::SFCONEOl and BS1012::SFCOCAT1 were chosen for 
further amplification. Individual colonies of both strains were independently amplified by growing them in different con- 
centrations of antibiotics as described in the methods section. For the cat gene carrying strain, the chloranriphenicol 
concentrations were 60, 80, 120 and 150 M-g/ml. For the neo gene carrying strain, the neomycin concentrations were 

5 160 and 180 fig/hil. In both strains only strains with minor amplifications of the SFCO's were obtained. In daughter 
strains generated from strain BS1012::SFCONEOl. the resistance to higher neomycin concentrations conrelated with 
the increase in the number of SFCO's In the chromosome and witii higher levels of carotenoids produced by tiiese cells. 
A different result was obtained with daughter strains obtained from strain BS1012::SFCOCAT1. In these strains an 
increase up to 150 \lq chloramphenicol/ml resulted, as expected, in a higher number of SFCO copies in the chromo- 

io some. 

Example 7 

Constmction of CrtW contpininq pipsmltfs pnd iise for cgrotgnolti production 

IS 

Polymerase chain reaction based gene synthesis. The nucleotide sequence of the artificial crtW gene, encoding 
the ^carotene p-4-oxygenase of A/patigenes strain POI, was obtained by back translating the amino acid sequence 
outiined in [Misawa, 1995], using the BackTranslate program of the GCG Wisconsin Sequence Analysis Package, Ver- 
sion 8.0 (Genetics Computer Group, Madison, Wl, USA) and a codon frequency reference table of cofi (supplied by 

20 the Bach Translate Program). The synthetic gene consisting of 726 nucleotides was constructed basically according to 
the method described by [Ye. 1992]. The sequence of the 12 oligonucleotides (crtWI - crtW12) required for the synthe- 
sis are shown in Figure 25. Briefly, the long oligonucleotides were designed to have short overlaps of 1 5-20 bases, serv- 
ing as primers for the extension of the oligonucleotides. After four cycles a few copies of tiie full length gene should be 
present which is then amplified by the two terminal oligonucleotides crtW15 and crtW26. The sequences for these two 

ss short oligonucleotides are for the forward primer crtWIS (5'-TATATCTAGAca tataTCCGGTCGTAAA CCGG- S*) and for 
the reverse primer crtW26 (5'-TATAgaattccacgtgTCA AGCACGACCACCGGTTTTACG -ai. where the sequences 
matcfiing the DNA templates are underiined. Small cap letters show the introduced restriction sites (A/del for the for- 
ward primer and ScoRI and Pml\ for tiie reverse primer) for the latter cloning into the pALTER-Ex2 expression vector. 
Polymerase chain reaction. All twelve long oligonucleotides (crtWI -crtW12; 7 nM each) and both terminal primers 

30 (crtW15 and crtW26: 0.1 mM each) were mixed and added to a PCR reaction mix containing Expand^ High Fidelity 
polymerase (Boehringer, Mannheim) (3.5 units) and dNTP's (100 mM each). The PCR reaction was run for 30 cycles 
witii the following profile: 94 ^C for 1 min. 50 ^C for 2 min and 72 ^C for 3 min. The PCR reaction was separated on a 
1% agarose gel, and the band of approx. 700 bp was exdsed and purified using the glass beads method (Geneclean 
Kit, Biol 01 . Vista, CA. USA). The fragment was subsequentely cloned into the Sma\ site of plasmid pUC18. using the 

35 Sure-Clone Kit (Pharmacia, Uppsala, Sweden). The sequence of tiie resulting crtW synthetic gene was verified by 
sequencing with the Sequenase Kit Version 1 .0 (United States Biochemical, Cleveland, OH, USA). The crtW gene con- 
structed by this method was found to contain minor errors, which were subsequentiy corrected by site-directed muta- 
genesis. 

Construction ofplasmids. Plasmid pBIIKS(+)-CARVEG-E (see also Exarrple 5) (Figure 26) contains the carotenoid 

40 biosyntiiesis genes (crtE, crtB. crtY crti and crtZ) of the Gram (-) bacterium Flavotmcterium sp. strain R1534 WT 
(ATCC 21588) [Pasamontes. 1995 #732] cloned into a modified pBluescript II KS(-i-) vector (Stratagene, La Jolla, USA) 
carrying site I of the 6. subtHis veg promoter [LeGrice. 1986 #806]. This constitutive promoter has been shown to be 
functional in E. coli. Transfbrmants of E. coli strain TGI carrying plasmid pBIIKS(+)-CARVEG-E synthesise zeaxanthin. 
Plasmid pALTER-Ex2-crtW was constructed by cloning the Nde\ - EcoRI restricted fragment of the syntiietic crtW gene 

4s into the corresponding sites of plasmid pALTER-Ex2 (Promega. Madison. WQ. Plasmid pALTER-Ex2 is a low copy plas- 
mid witii the pi 5a origin of replication, which allows it to be maintained with ColEI vectors in the same host. Plasmid 
pBIIKS-crtEBIYZW (Figure 26) was obtained by cloning the Hind\\\'Pml\ fragment of pALTER-Ex2-crtW into tiie Hind\\\ 
and the blurrt end made Mlu\ site obtained by a fill in reaction with Klenow enzyme, as described elsewhere in [Sam- 
brook. 1989 #505]. Inactivation of the crtZ gene was done by deleting a 285 bp Nsi\'Nsi\ fragment followed by a fill in 

50 reaction and religation, resulting in plasmid pBIIKS-crtEBIY[AZ]W. Plasmid pBIIKS-crtEBIY[AZW] carrying the non- 
functional genes crtW and crtZ, was constructed by digesting the plasmid pBIIKS-crtEBIY[AZ]W with Nde\ and Hpal, 
and subsequent self religation of the plasmid after filling in the sites with Klenow enzyme. £. co// transformed with this 
plasmid had a yellow-orange colour due to the accumulation of p-carotene. Plasmid pBIIKS-crtEBIYZ[AW] has a trun- 
cated CrtW gene obtained by deleting the Nde\ - Hpal fragment in plasmid pBIIKS-crtEBIYZW as outlined above. Plas- 

55 mids pALTER-Ex2-crtEBIY[A2Wl and pALTER-Ex2-crtEBIY2[AW], were obtained by isolating the BamH\'Xba\ 
fragment from pBIIKS-crtEBIY[AZWl and pBIIKS-CTtEBIYZ[AW], respectively and cloning them into the BamHI and 
Xba\ sites of pALTER-Ex2. The plasmid pBllKS-crtW was constructed by digesting pBllKS-crtEBIYZW with Nsi\ and 
Sad, and self-religating the plasmid after recessing the DNA overhangs with Klenow enzyme. Figure 27 connpiies the 
relevant inserts of all tiie plasmids used in this paper. 
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Carotenoid analysis. E. colt TG-1 transformants carrying the drfferent plasmkJ constructs were grown for 20 hours 
in Luria-Broth medium supplemented with antibiotics (ampidllin 100 \iQ/rY\l tetracyclin 12.5 ftgAml) in shake flasks at 
37^0 and 220 rpm. Carotenoids were extracted from the ceils with acetone. Th acetone was removed in vacuo and 
the residue was re dissolved in toluene. The col ured solutions were subjected to high-performance liquid chromatog- 

5 raphy (HPLC) analysis which was performed on a Hewlett-Packard series 1 050 instrumem. The carotenoids were sep- 
arated on a silica column Nudeosil Si - 100. 200 x 4 mm. 3m. The solvent system included two solvents: hexane (A) 
and hexane/THF , 1 :1 (B). A linear gradient was applied running from 13 to 50 % (B) within 15 minutes. The flow rate 
was 1 .5 ml/nnin. Peaks were detected at 450 nm by a photo diode array detector. The individual carotenoid pigments 
were Identified by their absorption spectra and typical retention times as compared to reference samples of chemically 

w pure carotenoids, prepared by chemical synthesis and characterised by NMR. MS arxJ UV-Spectra. HPLC analysis of 
the pigments isolated from £. coli cells transformed with plasmid pBIIKS-crtEBIYZW, carrying besides the carotenoid 
biosynthesis genes of Flavobacterium sp. strain R1534, also the crtW gene encoding the p-carotene ketolase of 
Alcaligenes PC-1 [Misawa, 1995 #670] gave the following major peaks identified as: p-cryptoxanthin. astaxanthin, ado- 
nixanthin and zeaxanthin. based on the retention times and on the comparison of the absorbance spectra to given ref- 

is er&ice samples of chemically pure carotenoids. The relative anx)unt (area percent) of the accumulated pigment in the 
E. coli transformarrt carrying pBIIKS-crtEBIYZW is shown in TabUe 3 fCRX": cryptoxanthin; "ASX*: astaxanthin; "ADX": 
adonixanthin; "ZXN": zeaxanthin; "ECM": echinenone; "MECH": 3-hydroxyechinenone, "CXrsT: cantaxanthin]. The Z of 
the peak areas of alt identified carotenoids was defined as 100%. Number^ shown in Table 3 r^resent the average 
value of four independent cultures for each transformant. In contrast to the aforementioned results. E. cofi transform- 

20 ants carrying the same genes but on two plasmids namely, pBIIKS-crtEBIYZ[AW] and pALTER-Ex2-crtW. showed a 
drastical drop in adonixanthin and a complete lack of astaxanthin pigments (Table 3). whereas the relative amount of 
zeaxanthin ( %) had increased. Echinenone. hydroxyechinenone and canthaxanthin levels remained unchanged com- 
pared to the transforn^ants carrying all the crt genes on the same plasmid (pBIIKS-crtEBIYZAW). Plasmid pBIIKS-crtE- 
BIYZ[AW] is a high copy plasmid carrying the functional genes of crtE. crtB, crtY. crtl, crtZ of Ravobacterium sp. strain 

2s R1534 and a truncated, norvfunctional version of the crtW gene, whereas the functional copy of tiie crtW gene fe 
located on the low copy plasmid pALTER-Ex2-crfW. To analyze the effect of overexpression of the crtW gene with 
respect to the crtZ gene. £, coli cells were co-transformed witii plasmid pBIIKS-crtW carrying the crtW gene on the high 
copy plasmid pBIIKS-crtW and the low copy corrstruct pALTER-Ex2-crtEBIYZ[AW], encoding the Flavobacterium crt 
genes. Pigment analysis of these transformants by HPLC monitored the presence of p-carotene, cryptoxanthin, astax- 

30 anthin. adonixanthin. zeaxanthin. 3-hydroxyechine-none and minute traces of echinenone and canthaxanthin (Table 3). 
Transformants harbouring the crtW gene on the low copy plasmid pALTER-Ex2-crtW and the genes crtE. crtB, crtY and 
crti on the high copy plasmid pBllKS-crtEBIY[AZW] expressed only minor amounts of canthaxairttiin (5 %) t>ut high lev- 
els of echinenone {94%), whereas cells canrying the crtW gene on the high copy plasmid pBIIKS-crtW and the other crt 
genes on the low copy construct pALTER-Ex2-crtEBIY[AZW], had 78.6 % and 21.4 % of echinenone and canthaxan- 

35 thin, respectively (Table 3). 



Tabled 



plasmids 


CRX 


ASX 


ADX 


ZXN 


ECH 


HECH 


CXN 


pBllKS-crtEBIYZW 


1.1 


2.0 


44.2 


52.4 


<1 


<1 


<1 


pBIIKS-crtEBIYZ[AW] + pALTER-Ex2-crtW 


2.2 




25.4 


72.4 


<1 


<1 


<1 


pBllKS-crtEBIY[AZ]W 










66.5 




33.5 


pBllKS-crtEBIY[AZW] + pBIIKS-crtW 










94 
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50 Claims 

1 . A ONA sequence comprising one or rrrore DNA sequences selected from the group consisting of: 

a) a DNA sequence which encodes the GGPP synthase of Ravobacterium sp. R1534 (crtE) or a DNA 
55 sequence which is substantially homologous; 

b) a DNA sequence which encodes the prephytoene synthase of Ravobacterium sp. R1534 (crtB) or a DNA 
sequence which is substantially homologous; 
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c) a DNA sequence which enccxJes the phytoene desaturase of Flavobacterium sp. R1534 (crtl) or a DNA 
sequence which is substantially homologous; 

d) a DNA sequence which encodes the lycopene cyclase of Flavobacteriunn sp. R1534 (crtY) or a DNA 
6 sequence which is substantially homologous: i 

e) a DNA sequence which encodes the p-carotene hydroxylase of Flavobacterium sp. R1534 (crtZ) or a DNA 
sequence which is substantially homologous. 

10 2. A DNA sequence as claimed in claim 1 cornprising the following DNA sequences: 

a) a DNA sequence which encodes the GGPP synthase of Flavobacterium sp. R1534 (crtE) or a DNA 
sequence which is substantially homologous, and 

IS b) a DNA sequence which encodes the prephytoene synthase of Ravobacterium sp. R1534 (crtB) or a DNA 

sequence which is substantially homologous, and 

c) a DNA sequence which encodes the phytoene desaturase of Ravobacterium sp. R1534 (crtl) or a DNA 
sequence which is substantially homologous. 

20 

3. A DNA sequence as claimed in claim 1 comprising the following DNA sequences: 

a) a DNA sequence which encodes the GGPP synthase of Ravobacterium sp. R1534 (crtE) or a DNA 
sequence which is substantially homologous, and 

25 

b) a DNA sequence which encodes the prephytoene synthase of Ravobacterium sp. R1534 (crtB) or a DNA 
sequence which is substantially homologous, and 

c) a DNA sequence which encodes the phytoene desaturase of Ravobacterium sp. R1534 (crtl) or a DNA 
30 sequence which is substantially honrtologous. and 

d) a DNA sequence which encodes the lyoopene cyclase of Flavobacterium sp. R1534 (crtY) or a DNA 
sequence which is substantially honnologous. 

35 4. A DNA sequence as claimed in claim 1 comprising the following DNA sequences: 

a) a DNA sequence which encodes the GGPP synthase of Ravobacterium sp. R1534 (crtE) or a DNA 
sequence which is substantially homologous, and 

40 b) a DNA sequence which encodes the prephytoene synthase of Flavobacterium sp. R1534 (crtB) or a DNA 

sequence which is substantially homologous, and 

c) a DNA sequence which encodes the phytoene desaturase of Ravobacterium sp. R1534 (crtl) or a DNA 
sequence which is substantially homologous, arKi 

45 

d) a DNA sequence wNch encodes the lycopene cyclase of Ravobacterium sp. R1534 (atY) or a DNA 
sequence which is substantially homologous, and 

e) a DNA sequence which encodes the p-carotene hydroxylase of Flavobacterium sp. R1534 (crtZ) or a DNA 
so sequence which is substantially homologous. 

5. A DNA sequence as claimed in claim 4 which comprises in addition to the DNA sequences specHied in claim 4 a 
DNA sequence which encodes the p-carotene p4-oxygenase of Alcaligenes strain PC-1 (crt W) or a DNA sequence 
which is substantially homologous. 

55 

6. A DNA sequence as claimed in claim 3 which comprises in addition to the DNA sequences specified in claim 3 a 
DNA sequence which encodes the p-carotene p4-Qxygenase of Alcaligenes strain PC-1 (crt W) or a DNA sequence 
which is substantially homologous. 
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7. A vector comprising the DMA sequence of claim 1 . 

8. A vector comprising the DNA sequence of claim 2. 
5 9. A vector comprising the DNA sequence of claim 3. 

10. A vector comprising the DNA sequence of claim 4. 

11 . A vector conr^rising the DNA sequence of claim 5. 

10 

12. A vector comprising the DNA sequence of daim 6 

1 3. A cell which is transformed by the DNA sequence of claim 1 or the vector of daim 7. 
75 14. A cell which is transformed by the DNA sequence of daim 2 or the vector of daim 8. 

1 5. A cell which is transformed by the DNA sequence of daim 3 or the vector of claim 9. 

1 6. A cell which is transformed by the DNA sequence of claim 4 or the vector of daim 1 0. 

so 

1 7. A cell which is transformed by the DNA sequence of daim 5 or the vector of daim 1 1 . 

18. A cell which is transformed by the DNA sequence of claim 6 or the vector of daim 1 2. 

25 1 9. A cell which is transformed by the DNA sequence of daim 4 or the vector of claim 1 0 and a second DNA sequence 
which encodes the p-carotene p4-oxygenase of Alcaligenes strain PC-1 (crt W) or a DNA sequence which is sub- 
stantially homdogous or a second vector which comprises a DNA sequence which encodes the ^-carotene p4-oxy- 
genase of Alcaligenes strain PC-1 (crt W) or a DNA sequence which is substantially homologous. 

30 20. A cell which is transformed by the DNA sequence of claim 3 or the vector of claim 9 and a second DNA sequence 
which encodes the p-carotene p4-oxygenase of Alcaligenes strain PC-1 (crt W) or a DNA sequence which is sub- 
stantially homdogous or a second vector which comprises a DNA sequence which encodes the p-carotene p4-oxy- 
genase of Alcaligenes strain PC-1 (crt W) or a DNA sequence which is substantially horrralogous. 

35 21. The cell of any one of daims 13 to 20 which is a prokaryotic cell. 

22. The cell of claim 21 which is E. coli. 

23. The cell of daim 21 which is a Bacillus strain. 

40 

24. The cell of any one of daims 13 to 20 which is an eukaryotic cell. 

25. The cell of daim 24 which is a yeast cell. 
45 26. The cell of claim 24 which is a fungal cell. 

27. A process for the preparation of a desired carot&ioid or a mixture of carotenoids by cufturing a cell as claimed in 
any one of daims 13 to 26 under suitable culture conditions and isolating the desired carotenoid or a mixture of car- 
otenoids from such cells or the culture medium arKl. in case only one carotenoid is desired separating it by methods 

50 known in the art from other carotenoids which nrdght also be present. 

28. A process as claimed in claim 27 for the preparation of lycopene by culturing a cell as dalmed in claim 14. 

29. A process as claimed in daim 27 for the preparation of p-carotene by culturing a cell as claimed in claim 15. 

55 

30. A process as claimed in daim 27 for the preparation of echinenone by culturing cells as claimed in claim 18 or 20. 

31. A process as claimed In daim 27 for the preparation of canthaxanthin by culturing cells as claimed in claim 18. 
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32. A process as claimed in daim 27 for the preparation of zeaxanthin by cutturing cells as claimed in claim 17 or 19. 

33. A process as claimed in daim 27 for the preparation f ad nixanthin by culturing cells as claimed in claim 1 7 or 1 9. 

34. A process as claimed in daim 27 for the preparation of astaxanthin by culturing cells as claimed in claim 17. 

35. A process for the preparation of a food or feed composition characterized therein that after a process as claimed in 
any one of claims 27 to 34 has been effected the carotenoid or carotenoid mixture is added to food or feed. 
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1 MTPKQQFPLR DLVEIRLAQI SGQFGWSAP LGAAMSDAAL SPGKRFRAVL 

51 MLMVAESSGG VCDAMVDAAC AVEMVHAASL IFDDMPCMDD ARTRRGQPAT 

101 HVAHGEGRAV LAGIALITEA MRILGEARGA TPDQRARLVA SMSRAMGPVG 

151 LCAGQDLDLH APKDAAGIER EQDLKTGVXF VAGLEMLSII KGLDKAETEQ 

201 LMAFGRQLGR VFQSYDDLLD VIGDKASTGK DTARDTAAPG PKGGLMAVGQ 

251 MGDVAQHYRA SRAQLDELMR TRLFRGGQIA DLLARVLPHD XRRSA 
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1 MTDLTATSEA AIAQGSQSFA QAAKLMPPGI REDTVMLYAW CRHADDVIDO 

51 QVMGSAPEAG GDPQARLGAL RADTLAALHE DGPMSPPFAA LRQVARRHDF 

101 PDLWPMDLIE GFAMDVADRE YRSLDDVLEY SYHVAGWGV MMARVMGVQD 

151 DAVLDRACDL GLAFQLTNIA RDVIDDAAIG RCYLPADWLA EAGATVEGPV 

201 PSDALYSVII RLLDAAEPYY ASARQGLPHL PPRCAWSIAA ALRIYRAIGT 

251 RIRQGGPEAY RQRISTSKAA KIGLLARGGL DAAASRLRGG EISRDGLWTR 

301 PRA 
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Fig- 10 

1 MSSAIVIGAG FGGLALAIRL QSAGIATTIV EARDKPGGRA YVWNDQGHVF 

51 DAGPTWTDP DSLRELWALS GQPMERDVTL LPVSPFYRLT WADGRSFEYV 

101 NDDDELIRQV ASFNPADVDG YRRFHDYAEE VYREGYLKLG TTPFLKLGQM 

151 LNAAPALMRL QAYRSVHSMV ARFIQDPHLR QAFSFHTLLV GGNPFSTSSX 

201 YALIHALERR GGVWFAKGGT NQLVAGMVAL FERLGGTLLL NARVTRIDTE 

251 GDRATGVTLL DGRQLRADTV ASNGDVMHSY RDLLGHTRRG RTKAAILNRQ 

301 RWSMSLFVLH FGLSKRPENL AHHSVIFGPR YKGLVNEIFN GPRLPDDFSM 

351 YLHSPCVTDP SLAPEGMSTH YVLAPVPHLG RADVDWEAEA PGYAERIFEE 

4 01 LERRAIPDLR KHLTVSRIFS PADFSTELSA HHGSAFSVEP ILTQSAWFRP 

4 51 HNRDRAIPNF YIVGAGTHPG AGIPGWGSA KATAQVMLSD LAVA 
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1 MSHDLLIAGA GLSGALIALA VRDRRPDARI VMLDARSGPS DQHTWSCHDT 

51 DLSPEWLARL SPIRRGEWTD QEVAFPDHSR RLTTGYGSIE AGALXGLLQG 

101 VDLRWNTHVA TLDDTGATLT DGSRIEAACV IDARGAVETP HLTVGFQKFV 

151 GVEIETDAPH GVERPMIMDA TVPQMDGYRF lYLLPFSPTR ILIEDTRYSD 

201 GGDLDDGALA QASLDYAARR GWTGQEMRRE RGILPIALAH DAIGFWRDHA 

251 QGAVPVGLGA GLFHPVTGYS LPYAAQVADA lAARDLTTAS ARRAVRGWAI 

301 DRADRDRFLR LLNRMLFRGC PPDRRYRLZQ RFYRLPQPLI ERFYAGRLTL 

351 ADRLRIVTGR PPIPLSQAVR CLPERPLLQE RA 
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1 MSTWAAILTV ILTVAAMELT AYSVHRWIMH GPLGWGWHKS HHDEDHDHAL 

51 EKNDLYGVIF AVISIVLFAI GAMGSDLAWW LAVGVTCYGL lYYFLHDGLV 

101 HGRWPFRYVP KRGYLRRVYQ AHRMHHAVHG RENCVSFGFI WAPSVDSLKA 

151 ELKRSGALLK DREGADRNT 
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Fig. 14 

I ■ ^ CrtE 

#100: 5 * tataracraar ^agaggacaaal cracarAIGACGCCCAAGCAGCAGCAArrC 3* 



#101: 5 • TATATACmafiGTCAGCCGCGACGGCCTGTGG 3 * 



^ ^ I » CrtZ 

#104: 5 * tatatgaa£r£ ^wq<yaqaaal ccaear ATGACgArTTKKKrrKraATrr 3* 
EcoRl , RBS Ndel 



#103: 5 • GTTTCAGCTCTGCCTTGAGGC 3 ' 



CrtZ — *j ertV 

MUTl: 5 • GCGAAGGGGCGGATCGCAATAC gIia kaaaqaggat:glgeTr gAT SAGCCATGATCTCgTCATrR 3 • 



crtY— •^j p*- Crtl 

MUTi 5 • GCCCCCTGCTGCAGGAGAGAGC tl Q^aaggaoty ^ a; f g f g aoATSAGTTCCGCgATeCTr attc 3 • 

Muni * 



crtl — I » CrtB 

MUn: 5 ' GGTCATGCTGTCGGACCTGGCCGTCGCti gkaaggag jiZ3 GaatcATGACCGATCTGACGGCGACTTCC 3 ' 

B«inHl 



MUT5: 5 ' ATATATcr r a a ff r g cc r eer rtjr » afirrrrrrr r-pftr ^ f^r Tktinti 3 • 



Mirr6:5* atgattaaatCCCCCttrnaaGCGACGGCCAGGTCCGACAGC 3* 
hmxaHl 1^ crtl 



CAIU7 5 • CAGAACCCATCACCTGCCCGTC 3 ' 



dd- 5 ' CGCfiMUCTCGCCGGCAATAGTTACC 3 ' 
EcoRl 



caw: 5 • GTCACATGCATiSCArfirTACIaAiSClCATAAGCATGTfiACer^ 3 ' 

SphI S«cl ajuQ 
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Fig, 2Q/1 
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Fig, ZQI2 
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Fiflr 21/2 




- 0.24 kb- 
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Fig. 24/1 



CTAAATTGTAAGCGTTAATA rr TT G ITAAAATTCGCGTTAAAT'ri' rrGX IAAATCACCTC 

1 + + f 1- 1- 60 

GATTTAACATTCGCAATTATAAAACAATTTTAAOICCAATTTAAAAACAATTrACT^ 

ATTTTTTAACCAATAGCCCGAAATCGGCZAAAATCCCrTATAAATCAAAAGAATAGACCGA 

fil f ^ + ► 1 ► 120 

TAAAAAATTGGTTATCCCCCrrTAGCC GTT T T AGGgAATATTTA Gri IT CI l ATCTCQCT 

GATACGCTTGAGTGTTGTTCCAGTTTGGAACAAGAGTCCACTATTAAACAACCTCGACrC 

121 — ^ f + 1 1- 180 

CTATCCCAACTCACAACAAGGTCAAACCTTGTTCTCAGCTCAXAATTTCTTCCACCTSAC 

CAACGTCAAAGGGCC»AAAACCCTCTATCAGGCCGATGGCCCACTAC<7rGAAC^ 

181 1 f 1 H i—- »• 240 

GTTGCACTTTCCCG C T T T T rG GCAGATAgTCCCGCXACCGGGTGATGCA C rr G G TA CTGG 

CTAATCAAC I T r TTTGCqyTCGAGGTCCCGTAAAGCACTAAATCCGAACCCTAAAGCGAC 

241 1 + 1 1. f ■ ■ ■« 300 

GATTAGTTCAAAAAACCCCAGCTCCACCGCATTTCCTGATTTXGCCTTCCGATTTCCCrC 

CCCCCGATTTAGAGCTTGACGGGGAAAGCCGC»:CAACCrrGGCGAGAAAGGAACGGAAGAA 

301 + + : 360 

GCGGGCTAAATCTCGAACTGCCCCTTTCGGCCCCrrCCACCC ClUi ' I iCCi I ' UCCl iCl 1 

3fil tff?^— + +-I — ffl— H fI2L.ff2^If^ , 420 

TCGCTTTCCTCGCCCGCGATCCCGCGACCCrrCACATCGCCAGTGCGACG(MCArrG<^ 

CACACCCGCCCCGCTTAATGCGCCGCTACAGGCCGCCTCCCATTCGCCATTCAGGCT 

4 21 + +- ^ 1 ' ► 480 

GTGTGGGCGGCGCGAATTACGCGGCGATGTCCCGCGCACGGTAAGCGGTAAGTCCCSACGC 

CAACTGTTGGGAAGGGCCATCGGTGCGGGCCTCTTCGCTATTACGCCAGCTGGCGAAAGG 

4 81 f H — H 1 »• 340 

GTTGACAACCCTTCCCGCTAGCCACGCCCGGAGAAGCGATAATGCG6TCGACCGCTTTCC 

GGGATCTGCTGCAACGCGATTAAGTTGGGTAACGCCAGC GI X I rC CCACTCACGACCTTG 

541 + A H 1 t + 600 

CCCTACACGACGTTCCCCTAArrCAACCCATTGCGGTCCCAAAACGCTCAGTCCTGCAAC 

TAAAACpACCGCCAGIGAGCGCGCGTAATACGACTCACTATAGGGCGAArrGGASCtCCA 

SOI ; H +- + ^ ■ t t 660 

ATTTTGCTGCCCGTCACTCGCGCGCATTATGCTGAGTCAXATCCCGCTTAACCTCGAGGT 

CCCCGGTGGCGGCCGCrCTACTGGATCCGCGCCTCGCCCTTCGCCATCAGCAGCCGCCCr 

SSI f + * K 1 ► 720 

GGCGCCACCGCCGGCGAGATCACCTAGGCGCGGACCGGCAAGCGCTAGTCGTCGGCGGGA 

TGCGGATCGGTCAGCATCATCCCCATGAACCGCAGCGCACGACGCAGCGCGPGCCCCAGA 

7 21 + * + ■ -t- 780 

ACGCCTAGCCAGTCGTAGTAGGGCTACTTGGCGTCGCGTGCTCCCTCCCGCGCCCGGTCT 

TCGGGCGCGTCCAGCACGGCATGCGCCATCATCGCGAAGCCCCCCGGCCGCATGGGGCCC 

7 81 ^ 1- "< + 84.0 

AGCCCGCGCAGGTCGTGCCCTACGCGGTAGTAGCGCTTCCCGCCGCCCCCCTACCCCGCG 

GTGCCCATTCCGAAGAACTCGCAGCCTCTCCCCTGCCCAAGCTCCCCCCACATCCCGCCG 

841 + + -i- + 1 K 900 

CACGGGTAAGGCTTCTTGACCGTCCGACAGGCGACCCGTTCCACCCCGCTCTACCCCCGC 

TATTCCGATGCACTGACGGGCCCGATGCGCGTGCGCCCGCCCTGCCCCGCCGCCACCACC 

901 * * -I— + 960 

ATAACGCTACGTCACTGCCCGCGCTACGCGCACCCGGGCGGGACGGGGCGCCCGTGGTCC 



65 



EP0747 483 A2 



Fig. 24/2 

GCATCGCGCACCAACCCrrCCGACATGATCTGCTGATCCATGGCCCGTaiTTCCAAAX 

961 + * ► , ► 1020 

OTA^rGCGTCXrrrGGCyu^CCCTCTACTACACCaCTACGTACCCC^ 

GATCACCGATCCTGTCGCCTGATCGCA l XUi ' l I ' C CAATCCCCCGAGCCCTACCATGGCCC 

1021 i h 1. , K 1080 

CTAGTGCCTAGGACAGCGCACTACCGTAACAAACCrTACCGGOCTCCCaiTCCTACCCCC 

gAACGATCAAGCGGGgGAGAGACATGGAAATCGAGCGACC GG T C TT m rC STCACCGCCg 

1081 . K K 1 I 1140 

CrrCCTACTTCCCCCCCTCTCTGTACCTTTAGCTCCCTGCCCACAAACACCAGTGCCCCC 

1141 TOG^J^I^'GCCaXgGCCTCCGCC^ ^^^^ 

GGCGTAGCCCAGA C CC C CCCC 

TGCTGGCC<a.rCTttCCCSSAAOecaxccxrg rTO*t-r^^ >fyqip^^^ 

1201 1- . , « « > 12«0 

ACGACCCXK^AGACCGCCTTO^CTTCCTGCGCGCOCrTCCCXGCC^ 

1261 K -H ►-^ , , H 1320 

G«rrGGACtXKXTTCTCaACTtXa5aKWCATCGCGCCqCCC^ 

1321 1 ' ^ ^^ • K 1380 

CCGXCCTGq:GGAACACrrCAC6CGCCCGTAGCGCGGCCC;GCrT<%CXAa^^ 

AqGGGCCGCATGGACTGGACAGCrrTGCCCGT<X:GCTCACGATCAAOCTGATO^ 

^^^^^CTACCTqZrl x^GAAAC^^^ "^^ ^ ^^^^ 

TCAACATCKgCXCyXTTGCAGCCCacOCa^TCCCCCCGAACGAGCCCGTCCCGG^ 
1441 . 1 , , K ISOO 

ACTTCTACCCGGCMGAACCrCGOCrCCGCTACCCCCCCTTCCTCGCKMACCCCCCCCTC 

(rrGGCGTGATCCTCAACACCCXCTCGATCCCCXXGCAOGACGGACAGAXCGGACAfiCTCC 

1501 * i , , , H 1560 

CACCGCACTACCACTTgTCCCGGAGCTAGCG C CGCGT C CT GC CTGTCTAG CC T G T CC AGC 

CCTATGCCCCCAGCAAC(«GCCGTOXCCGCATGACCCTCCCCarGCCCCCCGACCCT 

1561 — — ■ -I 1 1 » 1620 

QGATACGCaCTCSrrCCGCCCGCACCGCCCCTACTGCGACCCCTXCCGCGCCCTCGAAC 

CCCGGCACGGCATCCCCCTCATGACCArccCGCCCGGCATCTTCCCCACCOCGXTOCTCC 

1621 »" 1 y ■ . .t ■ 1680 

GCGCCGT«XGTAGGCGCACXACTGGTAGCGCCXWCCTA<MCGCGTGC»CCtAa» 

AGGGGCTCCCGCAGCACCrrTCAGGACACCCTGGGCGCGGCOGTGCCCTTra 

ISai K f h i.-^. 1 K 1740 

TCCCCGACGGCGTCCTGCAACTCCTGTCCXSACCCCCCCCCCCACGGGAACGCGAOCGCCC 

TGGGAGAGCCCrrCGGAATACGCGGCCCTGTTGCACCACATCATCCXGAACCCCATGCTGA 

17 41 * ♦ ► K 1 ► 1800 

ACCCTCTCCGCAGCCTTATGCGCCGCGACAACCTGCTGTAGTAGCGCTTCGGGTACGACT 

ACCGACACGTCATCCXCCTCGACGCCGCATTGCGCATGCCCCCCAAGTGAACGAGCCTTT 

1801 + 1 + K 1 1- I860 

TGCCTCTCCACTACCCCCACCTGCCGCCTAACGCCTACCGCGCCTTCACTTCCrCGCAAA 

CATCGACCCCATCGTCATCACCCCCCCGATGCGCACCCCCATCCGGCCATTCCAGCSGCGA 
1861 h H K , , ► 1920 

CTACCTGGGGTAGCACTACTGCCCGCCCTACGCCTCGCCCTACCCCCGTAAGCTCCCGCr 

TCTTCCCCCGATGGATGCCCCCACCCTTCCCGCGGACGCCATCCGCGCCGCGCTGAACCG 
1921 ^ ♦ * ^ 1980 

ACAACCCCCCTACCTACCGGCCTCCGAACCGCGCCTGCCCTAGCCCCGGCCCGACTTCCC 
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Fjir. 24/3 

CCTGTCGCCCGACATGGTGGACGAGGTCCTGATC«CTGCGT<XTCGCCGaKa^ 
1981 ' — + h h -+ 2040 

cgacaccgcgctgtaccacctgctccacgactacccgacgcaggagccgcgccccgtccc 
tcaggcaccggcacgtcaggcckk:gcttgcccccggactgcxgctgtcgac«k^cg^ 

2041 + : + ► K 1 » 2100 

AGTCCCTGGCCGTGCAGTCCGCCGCCyulCCCCGCCCTGACGGCGACACCTGCCCGTGCTO 

CACCATCAACGAGATGTGCGGATCGGGCATGAACGCCCCCaiTGCTGG(XXATCa<^^ 

2101 : K 1- 1- 1. __i K 2X60 

GTGGTAGTTGCTCTACACGCCTAGCCCGTACTTCCGGCGCTACGACCCGGTACTGGACTA 

CGCCGCGGGATCGGCGGGCATCGTCGTCGCCGGCGGGATCGAGAGCATGTCGAACCMrCCC 

21 SI h + K y 1. > 2220 

GCGGCGCCCTAGCCGCCCGTAGCAGCACCGGCCGCCCrACCrCTCGTACAGCrTCX:OGOO 

CTACCTGCTGCCCAAGGCGCGGTCGGCGATOCGCATGGGCCATGACCGTGTGCTGGATCA 

2221 f »- 1 1 h 2280 

CATCGACGACCGCrrCCCCGa:ACCCCCTACCXXgTXCCCCCTACTCgC^ 

CATGTTCCTCCACCGGTTGGACGACCCCTATGXaU^GGGCCGCCTGATOCGa^ 

2281 -I ». - ■ ■ » ► K 2340 

GTACAAGGACCTGCCgAACCTCCTGCGGATA C TGr.I C X :c UXQGJtfrrACCCCTGGAAgCS 

CGAGGATTGCGCCGGCGATCACGGTTTCACCCGCG31GGCGCA 

2341 + + 1. ^4- 1 — + 2400 

QCTCCrAACGCGGCCGCTAGTGCrAAAGTCCGCGCTCCCCGTCCTGCT<3lTrAC^^ 

CAGCCTGGCCCGCGCGCAGCACCCCATCGCCAGC GG r uC Cri'L XiC CGCCGAGATCGCGCC 

2401 1 »• i., — ■ ■ ■- -» — »• 2460 

CTCCGACCGGGCGCGCGTCCTGCGGTAGCGGTCCCCACGGAACCGCCGG C T CTA GCGCGO 

CGTGACaTrCACGCCACGCAACGTGCAGACCACCGTCCaVTACCGACCaGAl^^ 

2461 + 1- 1 K 1 1. 2520 

GCACTGCCAGTGCCGTGCGTTCCACGTCTCGTGGCAGCrrATGGCTCCTCXACGGGCCGTT 

GGCCCCCCCCGAGAAGATCCCCCATCTGAAGCCCGCCTTCCCTGACGGTGGCACGCTCAC 
2521 1. ► K— ► » . . f 2580 



GGCGGCGAACAGCTCGTCGATCTCGGACGCCGCGGCGGCGCTGGTGATGATGCGCCACTC 

2581 + + H ». 1 f 2640 

CCGCCGCTTGTCGAGCACCTAGAGCCTGCCCCGCCGCCGCGACCACTACTACGCGGTCAG 

GCAGGCCGAGAACCTGCGCCTGACGCCGATCGCCCCCATCATCCGTCATCaSACCCATGC 

2 641 + 4. • 1. k 2700 

CGTCaXKTCTTCGACCCGGACTGCCGCTACCGCGCCXAGTACCCAGTACCCTGOGTACC 

CGACCGTCCCGGCCTCTTCCCGACGCCCCCCATCGGCGCCATGCGCAAGCTGCTGGACCG 

2701 + + + K -+ 1. 2760 

GCTGGCAGGGCCCGACAAGGGCTGCCGCGGGTAGCCGCCCTACCCCTTCCACGACCTGCC 

CACCGACACCCGCCTTCCCGATTACGACCTGTTCGAGGTGAACGAGOCATTCCCCCTCCT 

2761 + +— : + h 2820 

CTCCCTGTGGGCCCAACCGCTAATGCTGGACAAGCTCCACTTCCTCCGTAAGCGGCACCA 

CGCCATGATCGCGATGAAGGACCTTGGCCTGCCACACGATCCCACGAACATCAACGGCGC • 

282X + ^ + 1- h 2880 

GCCGTACTAGCGCTACTTCCTCGAACCGGACGGTGTGCTACCCTGCTTGTAGTTGCCGCC 

GCCCTCCCCGCrTCGGCATCCCATCGCCGCCTCCGCGGCCCCGATaVTGGTCACGCTGCT 

2881 - ■ ■ ■ 2940 

CCGGACCCGCGAACCCGTAGCGTAGCCGCGCAGCCCCCGCCCCTAGTACCAGTGCGACGA 

GAACGCCATCGCGCCGCCGCGCGCGACGCCCCGGGCCGCATCCGTCTGCATCGGCGGGGG 

2 941 ♦ + * + + 1. 3000 

CTTGCCCTACCGCCGCGCCCCCCGCTGCGCGCCCCCGCCTAGCCAGACCTAGCCGCCCCC 
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Fig- 2^^^ 

3001 -~ h + + , :; 30^0 

GCTCCCCTCCCCGTAGCGCCACCTTGCCGACTCGATTAACTAAACCCCCTTACGCGCAAA 

TTCGT«»CGATCXK3GGAACCCGAAACGGCCACGCCTGTTCTCGTTCCCntaa^ 
306X + K 1 ^ ^ 3120 

AACXACGTGCTACCCCCTTGGCCTTTGCCXCTCCCGACAACACCAACCCAGCT 

TCGC<»:CATGCCCCTCACGCGATCTCGCAGGCGCATGCGGamGCCGATCCCGTCGCXr 

3121 : «• 1. 1 c 3180 

ACCCCGGTACGGGCACTCy:GCrACACCGTCCGCCTACCCCGCAACGGCTAGGCCACCCT^ 

GACTGACCKAACGAAGGCACCGATGACGCCCAAGCAOCAATTCCCCCTACGCGXTCTGGT 

3181 ^ K , , _^ 3240 

CTGACTGCGTTGCTTCC 



CGAGATCAGGCTG(X»CA(aTCTCaKX:CAGTTCGGCCrrGGTCr<^^ 
3241 K ^ ^_ , ^ ^ 330P 




H 1 * 33€0 

CCGGTACTCCCTACCGCGGGACAGGGCCOCCTTTGCauUM5CeCGGCACGACtAOQACTX 

GGTCCCCGAAAGCTCGGGCGGGGTCTGCGATGCCATarrCGXTbCCCCCTGCOCCWTC^ 

3361 + ^ * ► 3420 

CCAGCCGCrrTCGAGCCCGCCCO^GACCCTACGCTACCACCTACGCCCGA^^ 

GATGCrrCCATGCCGCATCGCTGATCTTCGACGACATCCCCTCCATGGACCATGCa 
3421 + + A , , K 3480 

CTACCAGGTACGGCGTAGCGACTAGAAGCTGCTGTA<»CGACGTACCrOCtACCGTCCTG 

CCCTCGCGGTCAGCCCGCCACCCATGTCQCCCATCCKaSAGGGGCGCGC GG I GC ^ 

3481 ^ + + , 1 ■ ■ I 3340 

GGCAGCGCCACTCGGGCGGTOGCTACACCCGGTACCGCTCCCCCC(K»CCACGAAC^ 

3541 + 1- H 1= 1 ■ ■ -f » 3S00 

CTAGCCGGACrrAGTCGCTCCGCTACGCCTAAAACCCCCTCCCCGCCCCOC<WTGCGGCCT 

TCAGCGCGCAAGGCTCGTCGCATCCATGTCGCCCGCGATGGCACCCCTGCGGCTGTOCGC 
3 601 1. y K K , H 3660 

AGTCGCGCGTTCCCACCAGCGTAGCTACAGCGCGCGCTACCCTGGCCACCCCGACACCOC 

AGGCCACWATCTGCACCTCXIACGCCCCCAAGGACGCCCOCCKKSATCGAACtnWiC^^ 
3 661 + + ^. ^ ^ J. 3,2Q 

TCCCGTCCTAGACCTGGACGTGCGGCGCTTCCTGCGCXIGGCCCTAGCTTGCACrrC^^ 

CCTCAAGACCGGCGTGCTGTTCGTCCCGGGCCTCGAGATGCTGTCCATTATTAAGGGTCT 
3721 + + + 1. , H 3780 

CGAGTTCTGGCCGCACGAaKAGCAGCCCCCGGAGCTCrACGACAGGTAATAATTCCCACA 

GGACAAGGCCGAGACCGAGCAGCTCATCGCCTTCCGGCCTCAGCTTGGTCGGCTCTTCCA 
3781 -I- — — +— . 4. ■ ^. • ¥ 3840 

CCTGTTCCGGCTCTGGCTCGTCGACTACCGGAAGCCCCCAGTCGAACCAGCCCAGAACGT 

GTCCTATGACGACXTGCTGGACGTCATCGGCGACAAGGCCAGCACCCCCAACGATACOGC 
3841 + -t. + 1. ► 3900 

CAGGATACTCCTGGACGACCTGCACTAGCCGCTGTTCCGCTCCTGGCCGTTCCTATGCCC 

CCGCGACACCGCCGCCCCCCCCCCAAAGGGCGCCCTGATCGCGGTCGGACAGATGGGCGA 
3901 * + , , ^ » 3960 

CGCGCTGTGGCGGCGGGGGCCGOGTTTCCCGCCGGACTACCCCCAGCCTGTCTACCCCCr 

CGTGCCGCACCATTACCGCGCCAGCCGCGCCCAACTGGACCAGCTGATGCCCACCCGCCT 
3961 + + ^ 1, -4. 4020 

CCACCCCGTCGTAATCGCGCCCTCGGCGCCCGTTGACCTGCTCGACtACGCCTGGGCCOV 
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r\^r ^4/5 

CTTCCCCCCCGGGCAGATCGCGGACCTGCTCGCCaKrCTCCTCCCGCATGACATCCGCCG 

4021 + * -t- -t »• ». 4080 

CAAGGCGCCCCCCCTCTAGCGCCTGGACCACCGGGCGCACGACGGCCTACTGTAGGCGCC 

4081 "t- 1- * 4140 

GTCCCGGATCCGCGCGCCAGCCCACGTGTCCGGCAGCGCCGACTAAACCGGCGGCGCGTC 

GCGCGATGCGGCCGCGTCCAAGCCTCaK:CCGC«GAAGCCCGATCTTGG^ 

4141 + 1- + -+ > ■ " 4200 

CGCCCTACGCCGCCGCAGGTTCGGAGGCGCGCGCTCTTCGGGCTAGAACaSTCGGAACCr 

CGTGCTG^TCCGCTGCCGATACCCCTCCGGCCCACCCrcrcCCGAXGCGCGTCC^^ 
GCACGA CTAGGCGACCGCTA * ^ ' »• 4260 

GCGATAGATACCCAGCGCGGCGGCGATCGACCACGCGCAGGGCGGCGGCAGATGCGGAJU; 

4261 K 1 1 1 ■> 4320 

CGCTATCTATGCGTaXXK:CGCCCCTAGCTOCT<XMarrCGCGCCGCCC»^ 

CCCCTGCCCCCCCGAGGCATAATAGGGCTCGGCCGCGTCAXGCAGGCGGATGATGA^^ 

4321 ^ 1 1~: h 1 4380 

GGGGACGGCCCGGCTCCGTATTATCCCGAGCCOGCCCAGrrCGTCCOCCTACTXCrGC^ 

4381 + + — ^ fffflff^^fff^fl^ 4440 

TATCrCGCCCAGGCTTCCGTGGCCTGGGAOTTGGCAGCGGGGGCGGASCCGGTCGGTCAG 

GGCAGGCACATAGCAGCGCXCGATGGCCGCATCCTCGATCACGTCGCIMUXW^ 

4441 H >— - » — I ' ■ » 4500 

CCCrCCGTCTATCGTCGCGGGCTACCGCCGTAGCAGCTAGTGCAGCOCTCGCTACAACCA 

CAGCTGGAACGCAAGGCCCAGATCGCAGGCGCGATCaUXrACCGCATCGTCCTGCACGCC 

4501 -+ + h K- 1 ■ ■» 4560 

CTCGACCTTGCGTTCCGGGTCTACCGTCCGaxrrACGTCGTCGCGTAGCAGGACCT{»» 

CATCACCCGCGCCATCATCAC6CCCACGACCCCCGCGACGTCGTACGAATATTCCAGCAC 

4561 — + + + i— 1 h- 4 620 

GTAGTGGGCCCGGTAGTACTGCCGGTGCTGGCCCCCCTGCACCATCCTTATAAGGTCGTG 

GTCATCCAGGCTGCGGTATTCGCCATCCCCGACATCCATCGCGAAACCCTCGATCXGGTC 

4 621 i- + + h ¥ ¥ 4 680 

CACTACGTCCGACCCCATAAGCGCTAGCCGCTGTAGGTAGCCCTTTGGGAGCTAGTCCAG 

CATCGCCCAAAGGTCCGGGAAATCATGCCCCCGGGCGACCTGGCGCAGCGCCGCGAACGG 

4 681 + + 4. -+ 1. 4 740 

GTACCCGGTTTCCAGGCCCTTTACTACGGCGCCCCGCTGGACCCCCTCGCGGCGCTTCCC 

CGGCGACATCGGGCCGTCCTCGTGCAGCGCGCCCACCCTGTCGGCGCGCAGCGCCCCCAG 
4 74X ^ ^ ^ ^ ^ — » 4800 

GCCGCTGTAGCCCCGCAGGAGCACCTCGCGCCGCTCGCACACCCGCGCCTCGCGGCGGTC 

CCCCGCCTCTGGGTCCCCCCCCGCCTCGCCCCCAGAACCCATCACCTCCCCCTCGATCAC 

4801 * * + — + — + 4860 

GGCGCGGACACCCAGCGGCCGGCGGAGCCCCCGTCTTGGCTAGTGCACGCGCAGCTACTG 

GTCATCCGCATCCCTGCACCAGGCATAGAGCArCACCGTATCCTCGCGGATGCCGGGCGG 

4861 f f + + ». ¥ 4920 

CAGTACGCGTACGGACCTGGTCCGTATCTCGTACTGGCATACGAGCGCCTACGGCCCGCC 

CATCAGCTTGGCCGCCTGCCCGAAGCTTTCCGAACCCTCCGCGATGCCCGCTTCOGAACT 

4921 * ¥ + ¥ ► 4980 

GTAGTCGAACCGGCGGACGCGCrrCGAAACGCTTGGGACGCCCTACCCGCGAACCCTTCA 

CGCCGTCAGATCCGTCATGCGACGCCCACGTCCCACAGCATGACCTOCGCCGTGGCCTTG 
4981 * * ¥ + 1. + 5040 

GCGCCACTCTAGCCAGTACGCTGCCGGTCCAGGCTGTCCTACTCGACCCCGCACCCGAAC 
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GCGCTGCCAACGACACCCa;C»TGCCCGCACCCGGATGCGTGCCCGCCCCCACGXTGTAC 
CGCGACGGTTGCTCTGGGCCCTA^ ^ " ^^^'^ 

aagttcgcgatcgcgccgtcgcggttatgccggcgcaaccagcc<k:attck»tcacgatc 

5101 + ^ + + y 5160 

TTCAAGCCCTAGCGCCCCAGCGCCAAtAa^CGCCTTGGTCCGCCTAACGCJUrrCCl^ 

GGCTCGACCGAGAACGCGCTGCCCTGATGCCCCGACAGTTCGGTGCTGAAATCGOCGGGG 

51 SI ' * 1 ► ■■ * ' » 3220 

CCGAGCTGGCTCTTCCGCGACGOCACTACCCGGCTGTCAAGCCACGACrrrAGCCCCCCC 

CTGAAGAffGCGGCTCACGGTCAGGTGCTTGCGCAGGTCCCGCATGCCCCCGCGCrCCAGT 

5221 + + K 1 1 r-f 5280 

GACTTCTACGCCGACTGCCAGTCCACGAACGCGTCCAGCCCCTACCGCGCCCOG3VGCSTCA 

TCCTCGAAGATGCCCTCGCCATAGCCCCGGGCCTCCGCTTCCCAATCGACATCCGCGC^ 

S291 1. ► H ► 5340 

ACGAGCTTCTACGCGAGCCGTATCGCGCCCCGGAQCCfflUlGGGTTXQCTCTAGCCCOCCC 

CCCAGATCCCGAACGCGCGCAAGCACGTAATGCGTGGACATCCCCTCGGGGCCCACGCTG 

5341 + ► -I—, 1 > ■ ■» 5400 

GGGTCTACGCCTTGCCCGCGTTCCTCCATTACGCACCTGTAGOGGAOCCCCCOCTCaa^C 

GGATCGGTCACGCAGGGCGAATGCAGATACATCGAGauCATCCTCCGGCACCCCrGGCCCG 

5401 »• * + 1- 5460 

CCTAGCCAGTGCGTCCCGCTTACCTCTATCTAGCTCTTTAGCACGCCGTCCGCXCCOGC^ 

TTGAAGATCrCGTTCACCACCCCCTTGTAGCGCCGGCCGAAGATGACGCTCTGGTC GC CC 
54 61 ■ +— ™ + — — I I 5520 

AACTTCTAGAGCAAGTGCTCOGGGAAOVTCGCGCCXGCCTTCTACTGCGACACCA^ 

AGC7rTCTCGGGCCGCTTGGACAGGCCGAAATCCAGCACGAACAGCCACATCGACCA« 

SS21 H + »■ -+ 1 k 5580 

TCCAAGAGCCCCGCGAACCTGTCCGGCTTrACGTCGT G CrrulVi^TGT A GCTGgTCGCG 

TGCCGCTTCAGGATCGCGGCCTTGGTGCGCCCGCGGCGGCTATGGCCCAGCACCTCGCGA 

5581 f ' f i. ► , ». 5$40 

ACGGCCAAGTCCTAGCGCCGGAACCACGCGGGCCCCGCCCATACCGOGTCGTCCACCGCX 

TAGCTCTCCATCACGTCGCCGTTGCTGGCCACCCTATCCGCOCGCAACTGCCCCCCGTOC 

5641 + 1. 1. 1- 1 h 5700 

ATCGACACGTAGTGCAGCCGCAACGACCGGTGGCATACCCCCGCCTTGACGGCCGCCAOG 

ACCAGCCTGACGCCCGTGGCGCGATCGCCCTCGCTGTCGATCCGCCTGACGCCCGCATTC 

5701 + + + h ». : K 5760 

TCCTCGCACTGCGGGCACCGCCCTAGCGOGAGCCACAGCTAGGCGCACTCCCCCCGTAAC 

AGCAGCAGCGTGCCGCCAAGACGCTCGAACAGGGCGACCATGCCCGCGACCAGCTGGTTC 

57 61 + + ^ > 1. K 5820 

TCGTCCTCGCACCCCGGTTCTCCGACCTTGTCCCGCTCGTACCCGCGCTGCTCGACCAAC 

GTGCCGCCCTTGGCCAACCAGACGCCGCCGCGCCGTTCCAGCGCATGGATCAOCGCATAG 

5821 1- + h + 1. h 5880. 

CACCGCGCGAACCCCTTGGTCTGCCCCGGCGCGGCAAGGTCGCGTACCTAGTCGCCTATC • : 

ATCGAGCTGGTCGAAAACGGCTTCCCGCCGACCAGCAGCGTGTGGAACGAGAAGGCCTCC 

5881 * + ^ + 1. 594a 

TACCTCGACCACCTTTTCCCCAAGGGCCCCTGGTCGTCCCACACCTTGCTCTTCCGGACC 

CGCAGATGCGGGTCCTCGATGAAGCGCGCCACCATGCTGTGGACCGAGCGCTATGCCTGC 
5941 + ^ ^ ^ ^ ^ 6QQ0 

GCGTCTACGCCCAGGACCTACTTCGCGCGGTGCTACGACACCTGGCTCGCCATACGGACC 
ACGCGCATCACCGCCGGCGCGCCGTTCAGCATCTGGCCCACCTTCACGAACGCCGTGGTC 
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CCCAGCTTCAGATACCCCTCGCGATAGACCTCCTCGGCCTAATCGTGGAAGCGGCGATAG 
6061 • > K ». — — — 6120 

gggtcgaagtctatggggagcgctatctggaggacccgcarraccaccttcgccgctatc 

ccatcgacatcggcgggattgaaggaggcgacctggcggat» 
6121 — + — ■ 5£8q 

GGTAGCTGTAGCCGCCCTAACTTCCTCCGCTGGACCGCCTAGTCGAGCAGCAGCACCAAC 

ACCTATTCGAAGCTGCGCCCGTCCGCCCATGTCAGCCGGTAGAAGCCCGAGACCGGC3U5C 

6181 + — + k 1— I ► 6240 

TGCATAAGCTTCGACCCCGCCAGGCCGGTACACTCGGCCATCTTCCCGCTCTGCCCGTCC 

AGCGTCACGrCACGCTCCATCGCTTGGCCGCTGAOKXCCACACCTCTCCCAGG^ 

6241 + + — — I ■ ► 6300 

TCGCAGTGCAGTGCGAGGTAGCCAACCGGCGACTCCa»7rGTCGAGAGCGTCCGACXGC 

GCGTCGGTCACGACCGTCGGGCCTGCArCGAAGACCmGCCCTGATCGTTCCA^ 
6301 h h + K „ 1. 6360 

CCCAGCCAGTGCTGGaU^CCGGACGTAGCTTCTGau:CGGGACTAGa^^ 

GCGCay:CGCCGGGCTTGTCGCGGGCCT^ACGATGGTGGTCGCGAYGCCC3^ 

6361 + -I- 1 I ■ 6420 

CGCGCCGGCGGCCCGAACAGCGCCCGGXGCTGCTACCACCAGCGCTACGCCCOGCT 

AGGCGGATGGCAAGCGCAAGCCCGCCGAAAOrtXSaXXZGATGACGAt^ 

6421 + 1- 1- 1- 6480 

TCCGCCTACCCrrTCGCGTTCGOGCGGCTTTGGACGCGGCTACTGCTACCGCCTTGAGXAC 

CTCTCTCCTGCAGCAGGGGGCGTTCGGGCAGGCAGCGCACGGCCTGCGACAGCGGAATGG 

6431 1- — — — ► + 1 I 6540 

GAGAGAGGACGTCCTCCCCCCCAAGCCCGTCCGTCGCCTCCCGGACGCTCTCGCCrEACC 

GCGGGCGTCCGGTGACGATGCGAAGCCGGTCGGCCAATGTCAGGCGCCCGGCATAGAAGC 

6541 — — 1- K — + ► « ■ i 6600 

CGCCCCCAGGCCACTCCTACGCTTCGGCCAGCCGGTTACAGTCCGCOGCCCMAT^ 

GCTCGATCAGCGGCTGCGGCAGGCGGTAGAACCGCTGCAGCAGGCGATAGCGACGGTCG6 

6601 H * +— K 1 h 6660 

CGAGCTACTCGCCGACGCCGTCCGCCATCTTGGCGACGTCCTCCGCTATCGCXOCCACCC 

GCGCGCACCCGCGGAAa^GCATCCGCTTCAGCAGCCCCAGGAACCCGTCGCGATCCOCOC 
6661 — + + — ■ ■ - f ■ - - » 6720 

GATCGATGGCCCAGCCCCGCACCGCGCGACGGCCGGACGCGCTCCTCAGCTCGCGCGCCC 

6721 + : — + 1 > i 6780 

CTAGCTACCCGCTCGGCGCGTGGCGCGCTGCCCGCCTGCGCCAGCAGTCCJU3CGCGCGCC 

CGATGGCATCCGCGACCrGCGCGGCATAGCGCAGCGAATATCCGCTGACGGCGTGGAACA 

6781 + + 1- 1- 6840 

GCrACCGTAGGCGCTGGACGCGCCGTArCCCGTCGCrrATACGCCACTGCCCCACCTTGT 

GCCCTGCCCCCAGCCCAACCGGCACCGCCCCCTGCGCGTGCTCCCGCCAGAAGCCTATGG 

6841 + + +— h > 6900 

CGGGACGGGGGTCGGGTTGGCCGTGGCGGGGGACGCCCACCAGCGCGGTCTTCGGATACC 

CGTCATCGGCCAGCGCGATGCGCAGGATGCCCCTTTCCCGCCGCATCTCCTCCCCCCTCC 

6901 + + K . 6960 

GCAGTACCCGGTCGCGCTACCCGTCCTACGGGGAAAGCGCGGCGTAGAGCACGGGCCAGC 

ACCCCCGCCTCGCCGCATAGTCCAGCGACGCCTGCGCCAGCGCGCCATCCTCCAGATCGC 
6961 * + + + — + 7020 

TCGGCGCCGACCGCCCTATCAGG7CGCTGCGGACGCGGTCGCCCGGTAGCACGTCTACCC 
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Fi». 24/R 

CGCCGTCGCTCTACCGCCTATCCTCGATCACGATGCtSGGTGCGACTGAAGOGCACCAGAX 

7021 + + ♦ + —I K 7080 

GCGGCACCGACATCCCGCATACGACCTAGTCCrACGCCCACXCTGACrrCCCCTCCTCTA 

AGATGAACCGGTACCCCTCCATC7GCGGAACCGTCGCGTCCATCATCATCGGGCGCTCC5A 

7081 + * i. h > 7140 

TCTACTTCGCCATGGGCAGGTAGACGCCTTGCCAGCCKyVGCTACTAGTAGCC^ 

CGCCATGGGGGGCCTCGCTCTC<»TCTCGACCCCCACCAATTTCTG<aUACCCACGGT^ 

'^^^ GCCGTACCCCCCGCAGCCAGAGCT ' ^ 

GGTGCGGCGTCTCGACGGC»CCACGG<X:GTCGATCA<MCAGGCAOCCrCCaTCCCCGA« 

7201 + + ► ^ 1 ■» 7260 

CCACCCaZCAGAGCTGCCCTGGTCCCCGCAGCTAGTGCGTCCGTOGGAOCTXCGCCCTCC 

<OTCCCTCAGCCTCGCGCCGGTATCGTCCA<XGTCGCCACATOCCrrATTCCAOCGCAGM 
7261 1" ^ ». 1 , 7320 

GCAGGCAGTCGCAGCGCGGCCATAGCAGGTaKIAGCGCTCTACGCAtAAGGTOGOGTCn 

CGACACCCTGCAGCAGCCCCAXaVGCGCGCCCGCCTCGATCGAGCCATACCCT^^ 

7321 + 1- —t 1 ■ . » 7380 

GCTGXGGGACGTCGTCGGGCTAGtCGCGCGGGCGGAGCTA(XrrCGGTATCGGACAG^^ 

GGCGGCGCGAATGGTCGGGAAACGCGACCTCCTGATCCGTCCArrCGCCGCCTCS^ 
7381 — + 1 1- 1 f 7440 



GCGACAGGCGCGCCAGCC»TTCGGGCGAAAGATCCCTCTCC?rGGCACGACCAGCTCTGCT 

74 4X 1- + 1 1 ♦ 7300 

CGCTCTCCGCGCGGTCGCTAAGCCCGCTTTCTACGCACAGCACCCTCCTG^^ 

GGTCCGACGGGCCGGACCGCCCGTCGACCATCACGATGCCCQCATCC CC TCTcyag^ 

7301 1- + k h 1 I 7560 

CCAGGCTCCCCGGCCTGGCGCGCAGCTCGTAGTCCTACOCQCCTACGCCAGACCXCACM 

GAACGGCAACCGCGATCAGCCCACCGGACAGCCCCCCGCCCGCGATCAGCACATCATCGC 

7561 + + (. 1— 1 1- 7fi20 

CTTGCCGrrCGCGCTACTCGCGTGQCCTGTCGGCGCGCGCQCCCtAGTCGTC tA CTACCG 

TCATGTATTCCGATCCGCCCCTTCGCC G T C C rT C AGCAGCGCGCCCGAQC G TTTCAGCTC 

7621 + + h ^ 1. h 7680 

AGTACATAACGCTAGGCGGGGAAGCGCCAGGAAGTCGTCCCGCGGGCTCGCAAAGTaSA^ 

TGCCTTGAGGCTGTCGACCGAGGGCGCCCAGATGAAACCGAAGCTGACGCXGTTCXCGCC 

7681 — — ■ — — > I 7740 

ACGGAACTCCGACAGCTGGCTCCCGCGGGTCTACTTTCGCTTCCACTGCGTCAAGAGCGC 

GCCATCGACCGCGTGATGCATCCTGTGTGCCTGCTAGACCCGACGAAGATAGCCGCOCTT 

7741 + + k ■ !■■ . .. ■■«■ 7800 

CGGTACCTGGCGCACTACGTAGGACACACCGACCATCTGC GC r GC t TCTATCGGCGCGAA 

GGGGACATAGCCGAACGGCCAGCGCCCATGCACCAAGCCGTCATGCAGGAAATAGTAGAT 

7801 -r + 1. h 7860 

CCCCTGTATCGCCTTGCCGCTCCCGGGTACCTGCrrCGCCACTACGTCCTTTATCATCTA 

CAGCCCGTAGCAGGTGACCCCCACCGCCAGCCACCAGGCCAGATCCCACCCCATCGCCCC 

7 861- + -I- h + 1. 7920 

GTCGGGCATCGTCCACTGGGGGTCGCGGTCCGTGGTCCGGTCTAGGCTGGGCTAGCGCCG 

GATCGCGAACACCACGATCGAGATTACCCCGAAGATGACGCCATAGAGGT CG r rC TT C TC 

7 921 + + + + 1 h 7980 

CTAGCCCTTGTCGTGCTAGCTCTAATGGCGCTTCTACTCCGCTATCTCCACCAAGAAGAC 
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798X ' +— * 1- 1 K 8040 

CTCGCGCACCAGCACTACGAGCAGCACCACGCTAAATACGCTCCXXMTCCCGTCCCCCCC 

ATGCATGATCCACCGATGGACGGAGTACGCCGTCACCTCCATCCCGGCGACGGTCAGGAT 

8041 ■»■ * — — — »■ — h ■ ■ ■ I I 8100 

TACGTACTAGGTGGCTACCTGCCTCATCCGGCAGTCGAGGTAGCGCCtSCTGCCAGTCCTA 

GACCGTCAGGATTGCGCCCCAAGTCCTCATGCCGGCCC X TT G CrTCATATGAaUXXaUU: 

8101 ^ f + + 4 ► 8160 

CTGCCAGTCCTAACGCCGGGTTCACCAGTACGGCCGGGGAACGAACTAXACTGTCCCTTS 

AGGCTXCSCTGCCGCGCCGTGCATGACCAGCCCATCGGGCTGCCACCAAXGCOCATOGCC 
8161 — — + + ». ■ — ■» ., „ , »■ , , 8220 

TGACATCTGCCTTCAGGGCTCATAGCCGGATCATCCCTtaCATTCCXXGCXGAACCWC^ 

8221 -t" ♦ + 1 t ' ¥ 8280 

ACTGTAGACGCAACTCCCGACTATCCGCCTAGTACCCACTCTAAOCGGOCOCrrOCOCCO 

AGGCGCATCACCCGTTCCGTCCCTGGAAATArrAATCTTTTCCC<aUWSATGCTCOC»^ 

TCOGCG TACTQC GCAAG^ *^ ' " ' * 

AGACGATTCGAACCTCCGACCTACGGTACCCAAAACCCTCCCOCXACCJUKCTGa^ 
8341 +- + ► I t. 8400 

TCTCCTAAGCTTGGACGCTGGATCCg^TGCt^f (■ i i ^fQCAgCffCCAT'SgTCTGACgCgATO 

GCCCCGACTGCGGAACGCTTTA<XCCATTGTTCCGGCAACGCaUUVGACCtAGT^ 

8401 + + + 1. 1 ». 8460 

CGGGGCTCACGCCTTCCGAAATCGgCTAACAAGGCC Ul l UU, l'i' H- l X. GATCACCGTCCG 

CAGGACCGCATTGTCCCCCATC<XCCCATGCGCCATCCGCTGACCCCCCrrCAGGCCA^ 

8461 + + + y 1 t 8320 

GTCCTGGCCTAACACCGCGTACGCGCCTACGCGGTAGCCGACTGGCCCGAAGTCCCGTTC 

GCGATCCGCCTCTCCGCCCGCGATTTCGACGACGAACaGCCCCTCXOGCTCCCGAXCG^ 
8S21 + h + 1 , ► 8580 



8581 + + + — t— — ,,. ■- — f 8640 

CTGGCGGCGCGGCCCTTACCCGCAGAGCACKrrCGCCCGCGCGTAACGCCACCTACACXCC 

GATGACCCCGCTTTCATCCCCAAAGACCATGrcaiGCGGGATCAGTGTCrTGCGCATCa 

CTACTGCGGCCAAAGTAGGCGTTTCrrGCTACACGTCGCCCTAGTCACACAACGCGTAGCT 

GAAGGACACCGCCTGGGGCGATTCGTAGATCAACAGCATTCCOCTGCCCCCAGCCACCTC 

CTtCCTGTGGCCGACCCCGCTAAGCATCTACTTGTCGTAAGGCCACGGOCGXCCGTCGAG 

CTTGCGGAACATCAGGCCCTGCGCGCGCTCTTCGGGGCTGTCCGCGACCTCGACCCGAAA 
8761 + 4. ^ ^ _^ ^ 8820 

GAACGCCTTGTACTCCCGGACGCCCGCGAGAAGCCCCCACACOCGCTCGACCTGGGCTTT 

CCCGAGCGTTTCCCCACCGGTATCGACGACAAGACTGCCCGGCCCCCATTCCACCCCCGC 

8821 * ♦ + ► 1- 1- 8880 

GGGCTCCCAAACGCGTGGCCATACCTCCTGTTCTGACGGCCCGCGCGTAACCTCGCOCCG 

CGCGGCGGCGGGCATCAGGACCGCAACAAGCGCTGCGCCCTTACTCCCCCACATOCGCAA 
8981 + — _ 1. ~+ 8940 

GCGCCGCCGCCCGTAGTCCTGCCGTTCTTCCCGACGCCGGAATGAGCCGGTGTACCCGTT 

GATAGGACTGC7CCGCGCCGAGATCCCCCGGGCTCCAGGAATTCCATATCAAGCTTATCG 
8 941 : ♦ + + + 9000 

CTATCCTGACGAGCCGCCGCTCTAGGGGGCCCGACGTCCTTAAGCTATACTTCGAATACC 
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ATACCGTCGACCTCGAGGGGGC»:CCCCTACCCA GC TTT ruriC CCrrTAGTGAGGGTTA 
9001 + + + ► , i, 

TATGGCAGCT GG AGCTCCCCCCCGCGCC ATCGGTCGAAAACAAGGGAAATCACTCCCAAT 

ATTGCCCGCTTGGCGTAATCATGGTCATAGCTGTTTCCT G T G TGAAATTGTTATCCGCTC 
90€l ♦ + w 

TAACGCGCGAACCGCATTAGTACCAGTATCGACAAAGGACACACTTTAACAATAGGCGAG 
ACAATTCCACACAACATACGAGCCGGAAGCATAAAGTGTAAAGCCTaGGCTGCCXAATGA 



9060 



9120 



9121 



TGTTAAGGTCTGTTGTATGCTCGGCCTTCGTATTTCACATTTCGGACCCCACGGAXTACT 



9180 



9181 



CACTCGATTGAGTGTAATTAACGCAACGCGACTGACGGGCGAAXGCrraUSCCCrrTGGAC 



9240 



9241 K K ». > ■■ ' » » 9300 

AGCACGGTCGACGTAATTACTTAGCCGGTTCCGCGCCCCTCTCCOCCAAACGCAIAACOC 

CgCTCTTCCCCTTCCTCGCTCACTGACTCCCTGCGCTCG G r CG rT CUUC TGCGGCSACCC 

9301 K . I K 9360 

CCGAGAAGCCGAAGGAGCGAGTGACTGAGCGACGOGAOCCAGCAAGCCGACGC^ 

GTATCACCTCACTCAAAGGCGGTAATACGGTTATCCACAGAATCACCCGATAACGC^ 
9361 + - — + K 1 1 1- 9420 

CArAGTCGAGTGAGTTTCCGCCATTATGCCAATAG G TGICTTAGTCCCCT A rr GCG T C Cr •> 

» 

AAGAACATCTGACCAAAAGGCCAGCAAAAGGCCAGGAACCCTAAAAACGCCCCGTTGCTC 
942X — »- ^ + , u| 1. 9480 

TTwA iACACTCGTTTTCCGGTCGTTTTCCGGTCCTTGGCATTTIT CC GgCgCAACGAC 

GCGTTrrrCXATAGGCTCCGCCCCCCrGACGAGCATCAaUUUUlTCGAC^ 

9481 + ~+ + » ¥ ' ' • " ¥ 9S40 

CGCAAAAAGGTATCCGAGGCGGGGGGACTGCTCGTAGTGTTTTTAGCTGCGAGTTCACTC 

AGGTGGCGAAACCCGACAGCACTATAAAGATACCAGGCGTTTCCCCCTGCAA GC T C C C T C 

9541 — *— ^ 1- 1~ 1 »• 9600 

TCCACCGCrrrGGGCTGTCXTCATATTTCTATGGTCCGCAAAGGOOGACCTTCGACOGM 

GTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATACCTCTCCGCCTTTCTCCCrrCC 

9601 ■ ■ ■ ■ ■ »~ — ™— — . . ( I I 9660 

CACGCGACAGGACAAGGCTCGGACGOCGAATGGCCTATCCACAGGCGGAAACACGGAAGC 

GGAAGCCTGGCGCrTTCTCATAGCTCACGCTCTAGCTATCTCACTTCCGTCTAGCTC^ 
9661 — ' ' + — + — , ,, - », » 9720 

CCTTCCCACCGCGAAAGAGTATCGACTGCCACATCCATACACTCAAGCCACATCCAGCAA 

CGCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGrrCAGCCCGACCGCTGCGCCTTATOC 

9721 + + 4- + H h 9780 

GCGAGGTTCGACCCGACACACGrGCTTGGGGCGCAACTCGGGCTGGCCACOCCGAATAGG 

GGTAACTATCGTCTTGAGTCCAACCCGGTAACACACGACTTATCGCCACTGGCAGCAGCC 

9781 + + — * 4- -f ■ ■ ■ ■ » 9840 

CCATTGATAGCAGAACTCAGGTTGGGCCATTCTGTGCTCAATAOCGGTGACCGTCGTCCG 

ACTGGTAACAGGATTAGCACAGCGAGGTATGTAGGCGCTCCTACAGAGTTCrrGAAGTCC 
9841 + + ^ 1_ J. 9900 

TGACCATTGTCCTAATCGTCTCCCTCCATACATCCGCCACGATCTCTCAAGAACTTCACC 

TGGCCTAACTACGCCTACACTAGAAGGACAGTATTTGCTATCTGCGCTCTGCTGAAGCCA 
9901 + ♦ ^ 4. + 9950 

ACCGGATTGATGCCCATGTGATCTTCCTGTCATAAACCATAGACGCGAGACGACTTCGGT 
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GTTACCTTCGGAAAAACACrrTCGTAGCTCTTGATCCCCCAAACAAACCACCCCrGGTACC 

9961 + + + -i 1- 10020 

CAATCWAACCCTTTTTCTCAACCATCGAGAACTAGGCCGTrrCTTTGGTGGCGACCATCC 

GGTGOlTTTTTTCrrrrGCAAGCAGCAGATTACGCGCAGAAAAAAAGGATCTCAAGAACAT 

1002 L *- + 1- i 1 ■ * 10080 

CCACCAAAAAAACAAACGTTCGTCGTCTAATGCCCGTCTTTTTTTCCTAGA G TTCTT C T A 

CCTTTGATCTTTTCTACGGCGTCrGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATT 

10081 + + ■ ■ t 10140 

GGAAACTAGAAAAGATGCCCCAGACTGCGAGTCACCTTGCTTTTGAGTOCAATTCCCTAA 

TTGCTCATGAGATTATCAAAAAGGATC7TCACCTAGATCCTTTTAAATTXAAAATCAAGT 

10141 1- + 1- 1 - 10200 

AACCAGTACTCTAATAGTTrrrCCTAGAAGTGGATCrAGOAAAATTTAArrrTTACTTCX 

TTTAAATCAATCTAAAGTATATATGAGTAAACTTGGTCTGACACTTACCAATGCTTAATC 

10201 -+ ► >- ■ ■ > I »• 102 «0 

AAATTrACTTAGATTTCATATATACTCATTTGAACCACACTCTCAATGGrTXC^^ 

ACTGAGGCACCTATCTCAGCGATCTGTCTATTTCGTTCATCCATAaTTOCCTGA^^ 

10261 •»■ + - — ——I 1 ♦ 10320 

TCACTCCGTGGATAGACTCGCTAGACAGATAAACCAAGTAGGTATCAACGGACTGAGCOC 

GTCGTCTAGATAACTACGATACGGGAGGGCTTACCATCa^WCCCCACrrGCTGCAATGA 

10321 1- + ^ * ! * — I I 10380 

CAGCACATCTATTGATGCTATCCCCrCCCGAATGGTAGACCGGGGTCACGACGTTACTAT 

CCGCGAGACCCACGCTCACCGGCTCCAGATTTATCAGCAAgAAACCAGCCACCCGGAAGG 

10381 4, »■ » ■ ■» 10440 

OCCCCTCrGCGTGCCACTCGCCGACCTCTAAATAGTCGTTATTTGGTCOQTCOCCCTTCC 

GCCGAGCGCAGAAGTGGTCCTGCAACTTTATCCGCCTCCATCCAgTCTArTAAI t O TT GC 

104 41 — + + 1 i h 10300 

CGGCrrCGCCTCTTCACCAGGACCTTGAAATAGGCGGAGCTACGTaiGATXATTAAC^ 

CCCGAAGCTAGAGT AAGT AGTTCGCCAGTT AATAGTTTCCQCAAC G X T Ul TG OCATTCCT 

10501 + * + ,.»■,■■-„■ t , ■■ ■ » 10360 

GCCCTTCCATCTCATTCATCAAGCGGTCAArrATCAAACCCGTTGCAACAACGGlAAOT 

ACAGGCATCGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGttTTCC X ^ 

10561 + + + ► , I ■» 10620 

TCTCCGTAGCACCACACTGCGAGCAGCAAACCATACCGAAGTAAGTCGACGCCAACGCTT 

CCATCAAGGCGACTTACATGATCCCCa^TGTTGTGCAAAAAACCGGTTACCTCCTTCGCT 

10621 + + 1- — ^ ■ , - 10680 

GCTAGTTCCCCTCAATGTACTAGGGGGTACAACAC G TTrrTTCCCCAATCGAGGAAGCCA 

CCTCCGATCGTTCTCAGAACTAAGTTGGCCGCAGTGTrATCACTCATGGTTATGGCAGCA 

10681 * -+ 1- + 1- 10740 

GGAGGCTAGCAACAGTCTTCATTCAACCGGCGTCACAATAGTCAGTACCAATACCCTCGT 

CTGCATAATTCTCTTACTGTCATGCCATCCGTAAGAtC C TTTTCrrGTGACTGCTGACTAC 

10741 + -i- ^ ♦ f 1. 10800 

CACGTATTAAGACAAT'3ACAGTACGGTAGGCATTCTACGAAAACACACTCACCACTCATC 

tcaaccaactcattctgag;ijvtactgtatccggcgaccgagttcctcttgccccgcgtca 

1080L * + + + ^ 1. 10860 

agttggttcagtaacactcttatcacatacgccgctggctcaacgagaacgggccgcact 
atacgggataataccgcgccacatagcagaactttaaaactgctcatcattcgaaaacgt 

10861 + + * ♦ 10920 

TATGCCCTATTATCGCGCGGTGTATCGTCTTGAAATTTTCACGAGTAGTAACCrrrrCCA 

TCTTCGCGGCGAAAACTCTCAAGGATCTTACCGCTGTTCAGATCCAGTTCGATGTAACCC 



AGAAGCCCCCCTTTTGAGAGTTCCTAGAATGGCGACAACTCTACCTCAAGCTACATTCGG 
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ACTCGTGCACCCAACTCy^TCrrCAGCATCTTTTACTTTCACCACXXTTTCTGGCT 
10981 ♦ + i. ^. 1X040 

TGAGCACGTGGCTTGACTAGAAGTCCTrAGAAAATGAAAGTGGTCCCAAAGACCCACTCGT 

AAAACAGGAAGGCAAAATGCCGCAAAAAAGGGAATAAGGGCGACACGGAAATGTTGAATA 

H041 ^ * — ^ H , , 11100 

TTTTGTCCTTCCGTTrrACgGCgTTTT i T CCCTTATTCCCGCTGTGCCTTXACAACTTAT 

crCATACTCrrCCTTTTTCAATATTATTCAACCATTTATCAGGCrrrATTGTCTCATC^ 
11101 h * . ^ H ^ j^j^lSQ 

GAGTATGAGAAGGAAAAACTTATAATAACTTCCTAAATAGTCCCAATAACACAGTACTCG 

GGATACATATTrGAATGTArrrAGAAAAATAAACAAATAGGGGTTCCGCGCACArrrCCC 

11161 f — ~ ^ 1 t 11220 

CCTATGTATAAACTTACATAAATCTTTrrATTT G TT TA TCCCCAACGCGCCTGiaVAASGG 

CGAAAAGTGCCAC 



OCTTTTCACCGTG 
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5^ 



C3 
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CO 
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ca 
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CTTACCrraX^AAAAGACrrTCGTAGCTCTTGATCCGGCAAACAAACCACCCCTCCTAGC 
9961 ► + + + ^ 10020 

CAATGCAAGCCTTTTTCTCAACCATCGAGAACTAGGCCGTTTCrTTGGTGGCGACCATCG 
GGTGGTTTTTT^GTTTCCAAGCAGCAGATTACGCGCAGAAAAAAACGATCTCAAGAACAT 

10021 + ♦ 1- — — — * — » looao 

CCACCAAAJkAAACAAACGTTCGTCGTCTAATGCGCGT C TTTTTTTCGTAGA G rrcriCr A 

CCTTTGATCTTTTCTACGGCGTCTGACGCTCAGTGGAACCyUUUlCTCACGTTAAGGGATT 

10081 + + » ■ ■■^ 10140 

GGAAACTAGAAAAGATGCCCCAGACTGCGACTCACCTTOCTTTTGACTGCAATTCCCTAA 

TTGGTCATGAGATTATCAAAAAGGATCTTCACCTAGATCCTTTrAAATTAAAAATGAAGT 

10141 — ♦ -+ 1 ■ -f * 10200 

AACCAGTACrrCTAATAGTTTTTCCrrA(3U^CTGGATCTAGGAAAATTXAArrT« 

TrtAAATCAATCTAAACTATATATGAGTAAACTTGGTCOTACACTTACCaUirG^^ 

10201 :-+- ■ + »■ » I 1- 10250 

AAATTTAGTTAGATTTCATATATACTCaiTTTGAACCAGACTqTaUiT ^ 

ACTGACGCACCTATCTCAGCGATCTGTCTATTTCgrTCATCCATA G TT GC CT 

10261 + + 1 1 1- 10320 

TCACTCCGTGGATAGAGTCGCTAGACAGATAAAGCAASTAGGTATCAACGGXCTGAGGOC; 

GTCCTCTAGATAACTACGATACGGGACGOCTTACCATCrGGCCCCAGTCCrGCAATGATA 

10321 »• + ♦ p» ■ ■ ■■ ■■■ > — 10380 

CAGCACATCTATTGATGCTATCCCCTCCCCAATCCTAGACCGGCGTCAC<a«:c^^ 

CCQCGAGACCCAaSCTCACCGGCTCCACATTTATCAQCAAgAAACCAGCCAGCCGSAACC 

10381 + + + ¥ < ». 10440 

CCCCCTCTCGGTGCCACTGGCCGAGCTCTAAATAGTCCTTATTTCCTCOGTCGCCCTTCC 

GCCGAGCGCAGAAGTGGTCCTGCAACTTTATCCCCCTCCATCCAGTCrATT^ 

104 41 1 + + » , >■ 10300 

CGGCTCGCGTCTTCACCACGACGTTCAAATAGGCGGAGCTACGTC3W»TAArT3UlCAAC^ 

CGGGAAGCTAGAGTAAGTAGTTCGCCAGTTAATA<rrTTGCMCAAarrTGTTCCCArr^ 

lOSOl + * + 1 > ■ * 10560 

GCCCrrCGATCTCATTCATCAACCGGTCAATTATCAAACGCCrrrCCAACAACGCXAACGA 

ACAGGCATCGTGGTGTCACGCrCGTCGTTTGGTATGGCrrTCATTCA<Xrrca X.T l 

10561 + + + K 1 ■ * 10620 

TCTCCCTAGCACCACAGTCCCACCAGCAAACCATACCGAAGTAAGTCGAGOCCAACGCjrr 

CGATCAAGGCGAGTTACATGATCCCCCATGTTCrGCAAAAAAGCCCTTACCTCCTTCGCr 

10621 + + 1- +— 4- k 10680 

CCTACTTCCGCTCAATCTACTAGGGGGTACAACAC U 1 1 C ' i ' Tl 'CGCCAATCGAGGAAOCCA 

CCTCCGATCGTTGTCAGAACTAAGTTGCCCGCAGTCTTATCACTCATCGTTATGGCAGCA 

10681 * H + + h 1- 10740 

GGACGCTAGCAACAGTCTTCATTCAACCGGCGTCACAATAGTGAGTACCAATACCGTCCT 

CrCCATAATTCTCT^ACTGTCATGCCATCCGTAAGATGCTt'TlCl'GrGACTCCrrCACTAC 

10741 + * > — — ► 10800 

GACCTATTAAGAGAATGAC^GTACGGTAGGCATTCTACGAAAACA^CTGACCACTCATG 

TCAACCAAGTCATTC7GAGAJVTACTGTATGCGGCGACCGACTTGCTCTTGCCCCGCGTCA 

10801 * + + + ^ — ^---i- 10860 

AGTTGGTTCAGTAACACTCTTATCACATACGCCGCTGGCTCAACGAGAACGGGCCGCACT 

ATACGGGATAATACCGCGCCACATACCAGAACTTTAAAACTGCTCATCATTCGAAAACGT 

10861 + 1- * — ■ , ,. ( ' 10920 

TATXSCCCTATTATGCCGCGCTGTATCCTCTTGAAATTTTa^CGACTACTAACCTTTTCCA 

TCTTCGGGGCGAAAACTCTCAAGGATCTTACCGCTCTTGAGATCCACTTCGATGTAACCC 



AGAAGCCCCaCTTTTGAGAGTTCCTACAATGCCGACAACTCTACCTCAACCTACATTGCG. 
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Fio^. 24/12 

ACTCCTGCACCCAACTCATCTTCAGCATCrTTTACTTTCACCACCCrrrTCTGGGTGACCA 
10981 + * ^ , ^ ^ j^j^g^ 

TGAGCACGTCGGTTCACTAGAACTCCTAGAAAATGAAACTCSGTCCCAAAGACCCACTCGT 

AAAACAGGAAGGCAAAATGCCGCAAAAAAGGGAATAAGGGCGACACGGAAAXGTTCAAXA 

11041 * — H ^ ^ 11100 

TTTTGTCCTTCCGTTTTACGGCGTTT t i'i'CCCTTATTCCCGCTCTGCCTrtACAACTTAX 

CTCATACTCrTCCTTTTTCAATATTATTGAACCATTTATCAGGCTTArrGTCT C ATGAGC 
II 101 ^ + , , „ 111 SO 

GAGTATGAGAAGGAAAAAGTTATAATAACTTCGTAAATAGTCCCAAXAACAGACXACTCC 

GGATACATATrrGAATGTATTTAGAAAAATAAACAAATAGGCGTTCCGCGCACATTTCCC 

mSl . . " ^ . 1 . 11220 

CCTATGTATAAACTTACATAAATCTTTTTA i I ' i ' Gll ' tA TCCCCAAGGCgCGTGXAAAGGG 

OGAAAAGTGCCAC 



OCTTTTCACGGTG 
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